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Abstract 

We present several new results about smoothed analysis of multiobjective optimization 
problems. Motivated by the discrepancy between worst-case analysis and practical experi- 
ence, this line of research has gained a lot of attention in the last decade. We consider 
f \ , problems in which d linear and one arbitrary objective function are to be optimized over a 

set <S C {0, l}' 1 of feasible solutions. We improve the previously best known bound for the 

smoothed number of Pareto-optimal solutions to 0(n 2d cj) d ), where <j> denotes the perturbation 
parameter. Additionally, we show that for any constant c the c-th moment of the smoothed 
number of Pareto-optimal solutions is bounded by 0{{n 2d <f> d Y). This improves the previously 
^**^ , best known bounds significantly. Furthermore, we address the criticism that the perturbations 

in smoothed analysis destroy the zero-structure of problems by showing that the smoothed 
^ ' number of Pareto-optimal solutions remains polynomially bounded even for zero-preserving 

perturbations. This broadens the class of problems captured by smoothed analysis and it 
has consequences for non-linear objective functions. One corollary of our result is that the 
smoothed number of Pareto-optimal solutions is polynomially bounded for polynomial objec- 
. tive functions. 

1 Introduction 

In most real-life decision-making problems there is more than one objective to be optimized. For 
example, when booking a train ticket, one wishes to minimize the travel time, the fare, and 
the number of train changes. As different objectives are often conflicting, usually no solution is 
simultaneously optimal in all criteria and one has to make a trade-off between different objectives. 
The most common way to filter out unreasonable trade-offs and to reduce the number of solutions 
the decision maker has to choose from is to determine the set of Pareto-optimal solutions, where a 
solution is called Pareto-optimal if no other solution is simultaneously better in all criteria. 

Multiobjective optimization problems have been studied extensively in operations research and 
theoretical computer science (see, e.g., [5] for a comprehensive survey). In particular, many algo- 
rithms for generating the set of Pareto-optimal solutions for various optimization problems such 
as the (bounded) knapsack problem [THJ [T^] , the multiobjective shortest path problem [Tl fTTIfT^] . 
and the multiobjective network flow problem [8j [15] have been proposed. Enumerating the set 
of Pareto-optimal solutions is not only used as a preprocessing step to eliminate unreasonable 
trade-offs, but often it is also used as an intermediate step in algorithms for solving optimization 
problems. For example, the Nemhauser-Ullmann algorithm [16] treats the single-criterion knap- 
sack problem as a bicriteria optimization problem in which a solution with small weight and large 
profit is sought, and it generates the set of Pareto-optimal solutions, ignoring the given capacity 
of the knapsack. After this set has been generated, the algorithm returns the solution with the 
highest profit among all Pareto-optimal solutions with weight not exceeding the knapsack capacity. 
This solution is optimal for the given instance of the knapsack problem. 
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Generating the set of Pareto-optimal solutions (a.k.a. the Pareto set) makes only sense if few 
solutions are Pareto-optimal. Otherwise, it is too costly and it does not provide enough guidance 
to the decision maker. While, in many applications, it has been observed that the Pareto set is 
indeed usually small (see, e.g, [H] for an experimental study of the multiobjective shortest path 
problem), one can, for almost every problem with more than one objective function, easily find 
instances with an exponential number of Pareto-optimal solutions (see, e.g., [S]). 

Motivated by the discrepancy between worst-case analysis and practical observations, smoothed 
analysis of multiobjective optimization problems has gained a lot of attention in the last decade. 
Smoothed analysis is a framework for judging the performance of algorithms that has been proposed 
in 2001 by Spielman and Teng [20 in order to explain why the simplex algorithm is efficient in 
practice even though it has an exponential worst-case running time. In this framework, inputs are 
generated in two steps: first, an adversary chooses an arbitrary instance, and then this instance is 
slightly perturbed at random. The smoothed performance of an algorithm is defined to be the worst 
expected performance the adversary can achieve. This model can be viewed as a less pessimistic 
worst-case analysis, in which the randomness rules out pathological worst-case instances that are 
rarely observed in practice but dominate the worst-case analysis. If the smoothed running time of 
an algorithm is low and inputs are subject to a small amount of random noise, then it is unlikely to 
encounter an instance on which the algorithm performs poorly. In practice, random noise can stem 
from measurement errors, numerical imprecision or rounding errors. It can also model arbitrary 
influences, which we cannot quantify exactly, but for which there is also no reason to believe that 
they are adversarial. 

After its invention in 2001, smoothed analysis has been successfully applied in a variety of 
contexts, e.g., to explain the practical success of local search methods, heuristics for the knapsack 
problem, online algorithms, and clustering. A recent survey by Spielman and Teng [21] summarizes 
some of these results. One of the areas in which smoothed analysis has been applied extensively is 
multiobjective optimization. In 2003 Bcicr and Vocking 13 : initiated this line of research by showing 
that the smoothed number of Pareto-optimal solutions is polynomially bounded for any linear 
binary optimization problem with two objective functions. This was the first rigorous explanation 
why heuristics for generating the set of Pareto-optimal solutions are successful in practice despite 
their bad worst-case behavior. In the last years, Beier and Vocking's original result has been 
improved and extended significantly in a series of papers. A discussion of this work follows in the 
next section after the formal description of the model. 

1.1 Model and Previous Work 

We consider a very general model of multiobjective optimization problems. An instance of such a 
problem consists of d+ 1 objective functions V 1 , . . . , V d+1 that are to be optimized over a set S C 
{0, 1}™ of feasible solutions. While the set S and the last objective function V d+1 : S — > R can be 
arbitrary, the first d objective functions have to be linear of the form V t (x) = Vfxi + . . . + V%x n 
for x — (x%, . . . , x n ) G S and t G {l,...,d}. We assume without loss of generality that all 
objectives are to be minimized and we call a solution x £ S Pareto-optimal if there does not exist 
a solution y G S with V t (y) < V t (x) for all t G {l,...,d+ 1} and V t (y) < V t (x) for at least 
one t G {1, . . . ,d + 1}. 

If one is allowed to choose the set S, the objective function V d+1 , and the coefficients of the lin- 
ear objective functions arbitrarily, then, even for d = 1, one can easily construct instances with an 
exponential number of Pareto-optimal solutions. For this reason Beier and Vocking introduced the 
model of <p-smooth instances j3] , in which an adversary can choose the set S and the objective func- 
tion V d+1 arbitrarily while he can only specify a probability density function ft: [—1,1] — > [0,<fi] 
for each coefficient V* according to which it is chosen independently from the other coefficients. 
This model is more general than Spielman and Teng's original two-step model in which the adver- 
sary first chooses coefficients which are afterwards subject to Gaussian perturbations. In 0-smooth 
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instances the adversary can additionally determine the type of noise. He could, for example, spec- 
ify for each coefficient an interval of length l/<j> from which it is chosen uniformly at random. 
The parameter cf> > 1 can be seen as a measure for the power of the adversary: the larger <f> the 
more precisely he can specify the coefficients of the linear objective functions. The aforementioned 
example of uniform distributions shows that for (f> — > oo smoothed analysis becomes a worst-case 
analysis. 

The smoothed number of Pareto- optimal solutions depends on the number n of binary variables 
and the perturbation parameter 0. It is defined to be the largest expected number of Pareto- 
optimal solutions the adversary can achieve by any choice of S C {0,1}", V d+1 : S -» R, and 
the densities /* : [—1,1] — > [0,4>]. In the following we assume that the adversary has made ar- 
bitrary fixed choices for these entities. Then we can associate with every matrix V G W ixn the 
number PO(V) of Pareto-optimal solutions in S when the coefficients V* of the d linear objective 
functions take the values given in V. Assuming that the adversary has made worst-case choices 
for S, V d+1 , and the densities /*, the smoothed number of Pareto-optimal solutions is the expected 
value Ey [PO(V)], where the coefficients in V are chosen according to the densities /'. For c > 1, 
we call Ev [PO c (V)] the c-th moment of the smoothed number of Pareto-optimal solutions. 

Beier and Vocking [3] showed that for the bicriteria case (i.e., d = 1) the smoothed number of 
Pareto-optimal solutions is (3(n 4 (/>) and f2(n 2 ). The upper bound was later simplified and improved 
by Beier et al. [2] to 0(n 2 4>)- In his PhD thesis pQ, Beier conjectured that the smoothed number of 
Pareto-optimal solutions is polynomially bounded in n and 4> for every constant d. This conjecture 
was proven by Roglin and Teng [17] , who showed that for any fixed d > 1, the smoothed number 
of Pareto-optimal solutions is 0((n 2 </>)-^ d )), where the function / is roughly f(d) = 2 d d\. They 
also proved that for any constant c the c-th moment of the smoothed number of Pareto-optimal 
solutions is bounded by 0({n 2 (f)) c '^ d%> ). Recently, Moitra and O'Donnell [13] improved the bound 
for the smoothed number of Pareto-optimal solutions significantly to 0(n 2d (f> d ( d+1 ^ 2 ) . The bound 
for the moments could, however, not be improved by their methods. Very recently a combination 
of results by Goyal and Rademacher [TU] and our results in [5] was observed to yield a lower bound 
ofn{n d - 15 (j) d ) pH]. 

1.2 Our Results 

In this paper, we present several new results about smoothed analysis of multiobjective binary 
optimization problems. Besides general 0-smooth instances, we consider for some results only 
the special case of quasiconcave density functions. This means that we assume that every coef- 
ficient V* is chosen independently according to its own density function /*: [—1,1] — > [0,<fi] with 
the additional requirement that for every density ft there is a value x\ £ [—1,1] such that ft is 
non-decreasing in the interval [— l,xt] and non- increasing in the interval \x\, 1]. We do not think 
that this is a severe restriction because all natural perturbation models, like Gaussian or uniform 
perturbations, use quasiconcave density functions. Furthermore, quasiconcave densities capture 
the essence of a perturbation: each coefficient V* has an unperturbed value x\ and the probability 
that the perturbed coefficient takes a value z becomes smaller with increasing distance \z — x\\. 
We will call these instances quasiconcave <f>-smooth instances in the following. 

Beier and Vocking originally only considered </>-smooth instances for bicriteria optimization 
problems (i.e., for the case d = 1). The above described canonical generalization of this model to 
multiobjective optimization problems, on which Roglin and Teng's [T7] and Moitra and O'Donnell's 
results [13] are based, appears to be very general and flexible on the first glance. However, one 
aspect limits its applicability severely and makes it impossible to formulate certain multiobjective 
linear optimization problems in this model. The weak point of the model is that it assumes that 
every binary variable x\ appears in every linear objective function as it is not possible to set some 
coefficients V/ deterministically to zero. 

Already Spielman and Teng [50] and Beier and Vocking [3] observed that the zeros often encode 
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an essential part of the combinatorial structure of a problem and they suggested to analyze zero- 
preserving perturbations in which it is possible to either choose a density /* according to which 
the coefficient V? is chosen or to set it deterministically to zero. Zero-preserving perturbations 
have been studied in |18] and [1] for analyzing smoothed condition numbers of matrices and the 
smoothed complexity of binary optimization problems. For the smoothed number of Pareto-optimal 
solutions no upper bounds are known that are valid for zero-preserving perturbations (except trivial 
worst-case bounds), and in particular the bounds proven in [17] and [13] do not seem to generalize 
easily to zero-preserving perturbations. In this paper, we develop new techniques for analyzing the 
smoothed number of Pareto-optimal solutions that can also be used for analyzing zero-preserving 
perturbations. 

Theorem 1. For any d > 1, the smoothed number of Pareto-optimal solutions is 0(n d +d +d (f> d ) 
for quasiconcave (p-smooth instances with zero-preserving perturbations and 0((n<p) d +d +d ) for 
general cp-smooth instances with zero-preserving perturbations. 

In Section 11.31 we will present some applications of zero-preserving perturbations. We will see 
that they allow us not only to extend the smoothed analysis to linear multiobjective optimization 
problems that are not captured by the previous model without zero-preserving perturbations, but 
that they also enable us to bound the smoothed number of Pareto-optimal solutions in problems 
with non-linear objective functions. In particular, the number of Pareto-optimal solutions for 
polynomial objective functions can be bounded by Theorem [T] We say that a ^-smooth instance 
has polynomial objective functions if every objective function V , t G {1, . . . ,d}, is the weighted 
sum of at most m monomials, where the adversary can specify a (^-bounded density on [0, 1] for 
every weight according to which it is chosen. 

Corollary 2. For any d > 1, the smoothed number of Pareto-optimal solutions is 0(m d +d +d (j) d ) 
for quasiconcave (p-smooth instances with zero-preserving perturbations and polynomial objective 
functions. For general (p-smooth instances with zero-preserving perturbations and polynomial ob- 
jective functions the smoothed number of Pareto-optimal solutions is 0((m<p) d +d +d ). 

In addition to zero-preserving perturbations we also study the standard model of 0-smooth 
instances. We present significantly improved bounds for the smoothed number of Pareto-optimal 
solutions and the moments, answering two questions posed by Moitra and O'Donnell |13j . 

Theorem 3. For any d > 1, the smoothed number of Pareto-optimal solutions is 0(n 2d cp d ) for 
quasiconcave <j>-smooth instances. 

This improves the previously best known bound of 0(n 2d d ( d+1 )/ 2 ) (which is, however, valid 
also for non-quasiconcave densities) and it answers a question posed by Moitra and O'Donnell 
whether it is possible to improve the factor of d ( d+1 )/ 2 in their bound [13]. Together with the 
recent lower bound of tt(n d ~ 1 - 5 <p d ) [TO], which is also valid for quasiconcave density functions, this 
shows that the exponents of both n and </> are linear in d. 

Theorem 4. For any d > 1 and any constant c£N, the c-th moment of the smoothed number of 
Pareto-optimal solutions is 0({n 2d (p d ) c ) for quasiconcave (p-smooth instances and 0((n 2d ^> rf ( d+1 )) c ) 
for general (p-smooth instances. 

This answers a question in |13j whether it is possible to improve the bounds for the moments 
in |17j and it yields better concentration bounds for the smoothed number of Pareto-optimal 
solutions. Our results also have immediate consequences for the expected running times of various 
algorithms because most heuristics for generating the Pareto set of some problem (including the 
ones mentioned at the beginning of the introduction) have a running time that depends linearly 
or quadratically on the size of the Pareto set. The improved bounds on the smoothed number of 
Pareto-optimal solutions and the second moment of this number yield improved bounds on the 
smoothed running times of these algorithms. 
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1.3 Applications of Zero-preserving Perturbations 

Let us first of all remark that in the bicriteria case, which was studied in [3], zero-preserving 
perturbations are not more powerful than other perturbations because they can be simulated by 
the right choice of S C {0, 1}" and the objective function V 2 : S — > R. Assume, for example, that 
the adversary has chosen S and V 2 and has decided that the first coefficient V± of the first objective 
function should be deterministically set to zero. Also assume without loss of generality that V 2 
assigns unique values to the solutions in S. If this is not the case, we can make V 2 injective without 
reducing the smoothed number of Pareto-optimal solutions. We can partition the set S in classes 
of solutions that agree in all components except for the first one. That is, two solutions x G S 
and y & S belong to the same class if Xj = j/j for all i G {2, . . . , n}. All solutions in the same class 
have the same value in the first objective V 1 as they differ only in the binary variable Xi, whose 
coefficient has been set to zero. We construct a new set of solutions S' that contains for every class 
only the solution with smallest value in V 2 . It is easy to see that the number of Pareto-optimal 
solutions is the same with respect to S and with respect to S' because all solutions in S \ S 1 are 
dominated by solutions in S' . 

Then we transform the set S' C {0,1}" into a set S" C {0, l}" -1 by dropping the first 
component of every solution. Furthermore, we define a function W 2 : S" — > K that assigns to 
every solution x G S" the same value that V 2 assigns to the corresponding solution in S' . One 
can easily verify that the Pareto set with respect to S 1 and V 2 is identical with the Pareto-set 
with respect to <S" and W 2 . The only difference is that in the latter problem we have eliminated 
the coefficient that is deterministically set to zero. Such an easy reduction of zero-preserving 
perturbations to other perturbations does not seem to be possible for d > 2 anymore. 

Path Trading 

Berger et al. [5] study a model for routing in networks. In their model there is a graph G = (V, E) 
whose vertex set V is partitioned into mutually disjoint sets V\ , . . . , Vk- We can think of G as 
the Internet graph whose vertices are owned and controlled by k different autonomous systems 
(AScs). We denote by Ei C E the set of edges inside V,. The graph G is undirected, and each 
edge e G E has a length £ e G K>o- The traffic is modeled by a set of requests, where each request is 
characterized by its source node s G V and its target node t G V. We consider the case that a single 
request has to be routed from a source s G V\ to a target t G Vk- The Border Gateway Protocol 
(BGP) determines for this request the order in which it has to be routed through the ASes. We 
assume without loss of generality that the request has to be routed through V% , V% , . . . , Vk in this 
order, and we say that a path P is valid if it connects s to f and visits the ASes in the order 
specified by the BGP protocol. This means that the first AS has to choose a path Pi inside V\ 
from s to some node in V\ that is connected to some node V2 G V%. Then the second AS has to 
choose a path Pi inside V2 from i>2 to some node in V2 that is connected to some node V3 G V3 
and so on. For simplicity, the costs of routing a packet between two ASes are assumed to be zero, 
whereas AS i incurs costs of X) e eP ^ e ^ or rou tuig the packet inside Vi along path Pj. In the common 
hot-potato routing, every AS is only interested in minimizing its own costs. To model this, there 
are k objective functions that map each valid path P to a cost vector (Ci(P), . . . , Cfc(P)), where 

Ci(P)= £ * far»€{l,...,fc}. 

eePtlEi 

In [5] the problem of path trading is considered. This problem considers the question whether 
ASes can reduce their costs if they deviate from the hot-potato strategy and coordinate the chosen 
paths for multiple requests. A main ingredient in the algorithm proposed in [5] to solve this problem 
is to generate the set of Pareto-optimal valid paths with respect to the k objective functions defined 
above. As this Pareto set can be exponentially large in the worst case, Berger et al. proposed to 
study (^-smooth instances in which an adversary chooses the graph G and a density f e : [0, 1] — > 
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[0, 4>] for every edge length £ e according to which it is chosen. It seems as if we could easily 
apply the results in [17] and [13] to bound the smoothed number of Pareto-optimal paths. If we 
set S C {0, ljl^l to be the set of incidence vectors of valid paths, then all objective functions Ci 
are linear in the binary variables x e , e G E. For x G S, we have 

Ci(x) = ^ ieXe ■ 

Note, however, that different objective functions contain different variables x e because the coeffi- 
cients of all x e with e ^ Ei are set to zero in Ci. This is an important combinatorial property of 
the path trading problem that has to be obeyed. In the model in [17] and [13] it is not possible to 
set coefficients dctcrministically to zero. The best we can do is to replace each zero by a uniform 
density on the interval [0,1/0]. Then, however, an AS would incur positive costs for any edge 
that is used and not only for its own edges, which does not resemble the structure of the problem. 
Theorem [TJ which allows zero-preserving perturbations, yields immediately the following result. 

Corollary 5. The smoothed number of Pareto-optimal valid paths is polynomially bounded in \E\ 
and 4> for any constant k. 



Non-linear Objective Functions 

Even though we assumed above that the objective functions V 1 , . . . , V d are linear, we can also 
extend the smoothed analysis to non-linear objective functions. We consider first the bicriteria 
case d = 1. As above, we assume that the adversary has chosen a set S C {0,1}™ of feasi- 
ble solutions and an arbitrary injective objective function V 2 : S — > R. In addition to that the 
adversary can choose mi arbitrary functions if: S — > {0,1}, i G {1, ...,mi}. The objective 
function V 1 : S — >• K is defined to be a weighted sum of the functions 1\: 

mi 

v\x)=^in^), 

where each weight w\ is randomly chosen according to a density f\ : [0,1] — > [0,4>] chosen by the 
adversary. There is a wide variety of functions V (x) that can be expressed in this way. We can, 
for example, express every polynomial if we let I\, ... , 2^ be its monomials. 

We can linearize the problem by introducing a binary variable for every function l\ . Then the 
set of feasible solutions becomes 

S' = [y G {0, l}" 11 | 3x g S : Vt e {1, . . . ,mi} : l}{x) = Vl ) . 

For this set of feasible solutions we define W 1 : S' — > R and W 2 : S' — » K as follows: 

mi 

W 1 (y)=J2 w jVj and W 2 (y)=mm{v 2 (x) | Vi G {1, . . . ,mi} : //(x) = W } . 

3=1 

One can easily verify that the problem defined by S, V 1 , and V 2 and the problem defined by 5', 
W 1 , and W 2 are equivalent and have the same number of Pareto-optimal solutions. The latter 
problem is linear and hence we can apply the result by Beier et al. [2], which yields that the 
smoothed number of Pareto-optimal solutions is bounded by 0(m\4>). This shows in particular 
that the smoothed number of Pareto-optimal solutions is polynomially bounded in the number of 
monomials and the density parameter for every polynomial objective function V 1 . 

We can easily extend these considerations to multiobjective problems with d > 2. For these 
problems the adversary chooses a set S C {0,1}™, numbers mi, . . . ,rrid G N, and an arbi- 
trary injective objective function V d+1 : S — > R. In addition to that he chooses arbitrary func- 
tions If: S —> {0, 1} for t G {1, . . . , d} and i G {1, . . . , m t }. Every objective function V* ; S — > R is 
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a weighted sum of the functions I t 

m t 
i=l 

where each weight w\ is randomly chosen according to a density /* : [0, 1] — > [0, (f>] chosen by 
the adversary. Similar as the bicriteria case, also this problem can be linearized. However, the 
previous results about the smoothed number of Pareto-optimal solutions can only be applied if 
every objective function V* is composed of exactly the same functions If. Theorem [T] implies that 
the smoothed number of Pareto-optimal solutions is polynomially bounded in m i arL d 4>, for any 
choice of the 1\ . 

Outline 

After introducing some notation in the next section, we present an outline of our approach and 
our methods in Section [21 In Section 0] we prove Theorems [3] and 0] In Section [5] we consider zero- 
preserving perturbations and prove Theorem[TJ We conclude the paper with some open questions. 

2 Notation 

For the sake of simplicity we write V t x instead of V t (x), even for the adversarial objective V d+1 . 
With y k i--- k t x we re fer to the vector (V kl x, . . . , V *x). In our analysis, we will shift the solu- 
tions x G S by a certain vector u G {0, 1}™ an consider the values V t ■ (x — u). For the linear objec- 
tives we mean the value V t x — V t u, where V t u is well-defined even for a shift vector u G {0, 1}™ \<S. 
For the adversarial objective, however, we define V d+1 ■(x — u) := V d+1 x. It should not be confused 
with V d+1 y for y — x — u. In the whole paper let e > be a small real for which 1/e is integral. 
Let b = (bx, . . . , bd) G R d be a vector such that bk is an integral multiple of e for any k. We will call 
the set B — {(yx, ■ ■ . , yd) G M. d : yf. G (bk, bk + s] for any fc} an e-box and b the corner of B. For 
a vector x G { — 1, 0, 1} the expression By(x) denotes the unique e-box B for which V 1 '" d x G B. 
We call B the e-box of x and say that x lies in B. With M £ we denote the set of all e-boxes having 
corners b for which b G {— n/e, . . . ,n/e — l} d . Hence, |B e | = (2n/e) d . If all coefficients of V are 
from [—1,1], which is true for any of the models considered in this paper, and if for any k = 1, . . . , d 
there is an index i such that |^ fc | < 1, which holds with probability 1 in any of our models, then 
the e-box of any vector x G { — 1, 0, 1}™ belongs to B e . Note, that all vectors x constructed in this 
paper are from {—1,0, 1}". Hence, without any further explanation we will use that By{x) G B e . 

In this paper we extensively use tuples instead of sets. The reason for this is that we are not only 
interested in certain components of a vector or a matrix, but we also want to describe in which order 
they are considered. This will be clear after the introduction of the following notation. Let n, m 
be positive integers and let a±, . . . , a„, b\,...,b m be arbitrary reals. We define [n] — (1, . . . , n), 
[n] = (0,1,..., n), (ai, . . . ,a„) =n and (at, . . . , a n ) U (bx, . . . , b m ) = (a x , ■ ■ ■ ,a n ,b x , ■ ■ ■ ,b m ). By 
(ax, ■ • ■ , a n ) \ (bx, • ■ ■ , b m ) and (ax, ■ ■ ■ , a n ) fl (bx, ■ ■ ■ , b m ) we denote the tuples we obtain when we 
remove all elements from (ax, ■ ■ ■ , a n ) that (do not) belong to (bx, ■ ■ • , b m ). We write (ax, ■ ■ ■ , a n ) C 
(bx, ■ ■ ■ , b m ) if a/c G (&i,..., b m ) for any index k G [ri]. Let x be a vector and let A be a matrix. 
By as|ii...» n = a^ri]., » n ) we denote the column vector (a?j 1( ... ,X{ n ), by A\u lt __ ^ n \ we denote the 
matrix consisting of the rows ix, ■ ■ . ,i n of matrix A in this order. 

For an index set / C [n] and a vector y G {0, 1}" let Si(y) denote the set of all solutions z G S 
such that Zi — yi for any index i G /. For the sake of simplicity we also use the notation Si(y) to 
describe the set {z G S : Zi = {ji} for a vector y G {0, l}' 7 ' when the components of y are labeled 
by Vh,- ■ -,yi m where / = (ix,. . .,i\i\). 

With I„ we refer to the n x n-diagonal matrix diag(l, . . . , 1) and with O mX n to the m x n-matrix 
whose entries are all zero. If the number of rows and columns are clear, then we drop the indices. 
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3 Outline of our Approach 



To prove our results we adapt and improve methods from the previous analyses by Moitra and 
O'Donnell [T3J and by Roglin and Teng Q7] and combine them in a novel way. Since all coefficients 
of the linear objective functions lie in the interval [—1, 1], for every solution x G S the vector V 1 " x 
lies in the hypercube [—n, n] d . The first step is to partition this hypercube into e-boxes. If e is very 
small (exponentially small in n), then it is unlikely that there are two different solutions x G S 
and y G S that lie in the same e-box B unless x and y differ only in positions that are not perturbed 
in any of the objective functions, in which case we consider them as the same solution. In the 
remainder of this section we assume that no two solutions lie in the same e-box. Then, in order 
to bound the number of Pareto-optimal solutions, it suffices to count the number of non-empty 
e-boxes. 

In order to prove Theorem [3J we show that for any fixed e-box the probability that it contains 
a Pareto-optimal solution is bounded by Kn d 4> d e d for a sufficiently large constant k. This implies 
the theorem as the number of e-boxes is (2n/e) d . Fix an arbitrary e-box B. In the following we 
will call a solution x G S a candidate if there is a realization of V such that x is Pareto-optimal 
and lies in B. If there was only a single candidate x G S, then we could bound the probability 
that there is a Pareto-optimal solution in B by the probability that this particular solution x lies 
in B. This probability can easily be bounded from above by e d (j) d . However, in principle, every 
solution x6S can be a candidate and a union bound over all of them leads to a factor of \S\ in 
the bound, which we have to avoid. 

Following ideas of Moitra and O'Donnell, we divide the draw of the random matrix V into 
two steps. In the first step some information about V is revealed that suffices to limit the set of 
candidates to a single solution x G S. The exact position V 1 " d x of this solution is determined in 
the second step. If the information that is revealed in the two steps is chosen carefully, then there 
is enough randomness left in the second step to bound the probability that x lies in the e-box B. In 
Moitra and O'Donnell's analysis the coefficients in the matrix V are partitioned into two groups. 
In the first step the first group of coefficients is drawn, which suffices to determine the unique 
candidate x, and in the second step the remaining coefficients are drawn, which suffices to bound 
the probability that x lies in B. The second part consists essentially of d(d + l)/2 coefficients, 
which causes the factor of <^ d ( d+1 )/ 2 in their bound. 

We improve the analysis by a different choice of how to break the draw of V into two parts. 
As in the previous analysis, most coefficients are drawn in the first step. Only d 2 coefficients 
of V are drawn in the second step. However, these coefficients are not left completely random as 
in [13] because after the other coefficients have been drawn there can still be multiple candidates 
for Pareto-optimal solutions in B. Instead, the randomness is reduced further by drawing d(d — I) 
linear combinations of these random variables in the first step. These linear combinations have the 
property that, after they have been drawn, there is a unique candidate x whose position can be 
described by d linear combinations that are linearly independent of the linear combinations already 
drawn in the first step. In |f 7] it was observed that linearly independent linear combinations of 
independent random variables behave in some respect similar to independent random variables. 
With this insight one can argue that in the second step there is still enough randomness to bound 
the probability that x lies in B. While the analysis in [T7] yields only a bound proportional to 4> d e d , 
we prove an improved result for quasiconcave densities that yields the desired bound proportional 
to c/) d e d . 

For analyzing higher moments, it does not suffice to bound the probability that a fixed e-box 
contains a Pareto-optimal solution. Instead, in order to bound the c th moment, we sum over all 
c-tuples (B\, . . . , B c ) of e-boxes the probability that all e-boxes B\, . . . ,B C contain simultaneously 
a Pareto-optimal solution. We bound this probability from above by K,n cd (jf d e cd for a sufficiently 
large constant k. Since there are (2n/e) cd different c-tuples of e-boxes, this implies the bound 
of 0{{n 2 <j)) cd ) for the smoothed c th moment of the number of Pareto-optimal solutions. 
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Let us fix a c-tuple (B\, . . . , B c ) of e-boxes. The approach to bound the probability that all 
of these e-boxes contain simultaneously a Pareto-optimal solution is similar to the approach for 
the first moment. We divide the draw of V into two steps. In the first step enough information 
is revealed to identify for each of the e-boxes Bi a unique candidate Xi £ S for a Pareto-optimal 
solution in Bi. If we do this carefully, then there is enough randomness left in the second step to 
bound the probability that V 1 " A Xi £ Bi for every i £ [c]. Again most coefficients are drawn in the 
first step and some linear combinations of the other cd 2 coefficients are also drawn in the first step. 
However, we cannot simply repeat the construction for the first moment independently c times 
because then there might be dependencies between the events V 1 " x^ £ Bi for different i. In order 
to bound the probability that in the second step all Xi lie in their corresponding e-boxes Bi, we 
need to ensure that the events V 1 " Xi £ Bi are (almost) independent after the information from 
the first step has been revealed. 

The general approach to handle zero-preserving perturbations is closely related to the approach 
for bounding the first moment for non-zero-preserving perturbations. However, additional com- 
plications have to be handled. The main problem is that we cannot easily guarantee anymore 
that the linear combinations in the second step are linearly independent of the linear combinations 
revealed in the first step. Essentially, the revealed linear combinations describe the positions of 
some auxiliary solutions. For non-zero-preserving perturbations revealing this information is not 
critical as no solution has in any objective function exactly the same value as x. For zero-preserving 
solutions it can, however, happen that the auxiliary solutions take exactly the same value as x 
in one of the objective functions. Then there is not enough randomness left in the second step 
anymore to bound the probability that x lies in this objective in the e-interval described by the 
e-box B. 

In the remainder of this section we will present some more details on our analysis. We first 
present a simplified argument to bound the smoothed number of Pareto-optimal solutions. Af- 
terwards we will briefly discuss which changes to this argument are necessary to bound higher 
moments and to analyze zero-preserving perturbations. 

Smoothed Number of Pareto-optimal Solutions As an important building block in the 
proof of Theorem[3]we use an insight from [13] how it can be tested whether a given e-box contains 
a Pareto-optimal solution. Let us fix an e-box B = (pi, b\ + e] x ... x (&<$,&<$ + £]. The following 
algorithm takes as parameters the matrix V and the e-box B and it returns a solution x^°K 

Witness(V, B) 
1: Set K d+1 = S. 
2: for t = d, d — 1, . . . , do 
3: Set C t = {z£ Ht+i ■ V 1 - t z<b\ 1 ... t }. 
4: Set x (t) = argmin {V t+1 z : z £C t ). 
5: Set Kt = {z £ K t+1 : V t+1 z < V t+1 x^}. 
6: end for 
7: return x^ 

The actual Witness function that we use in the proof of Theorem [3] is more complex because 
it has to deal with some technicalities. In particular, the case that some set C t is empty has 
to be handled. For the purpose of illustration we ignore these technicalities here and assume 
that Ct is never empty. The crucial oberservation that has been made by Moitra and O'Donnell 
is that if there is a Pareto-optimal solution x £ S that lies in B, then x^ ' = x (assuming that no 
two solutions lie in the same e-box). Hence, the solution x^ returned by the Witness function 
is the only candidate for a Pareto-optimal solution in B. While this statement and the following 
reasoning are true for any d £ N, we recommend the reader to think first only about the illustrative 
case d = 2, in which there are one adversarial and two linear objective functions. 
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Our goal is to execute the Witness function and to obtain the solution x^ without revealing 
the entire matrix V. We will see that it is indeed possible to divide the draw of V into two steps 
such that in the first step enough information is revealed to execute the Witness function and such 
that in the second step there is still enough radnomness left to bound the probability that x^ lies 
in B. For this let I C [n] be a set of indices and assume that we know in advance which values 
the solutions x^ , . . . , x^ d ' take at these indices, i.e., assume that we know — x^ \i, . . . , = 
x^\i before executing the Witness function. Then we can reconstruct x^°\ . . . , x^ d > without 
having to reveal the entire matrix V. This can be done by the following algorithm, which gets as 
additional parameters the set / and the matrix A = (a™, . . . , u"). 

Witness(F, I, A, B) 

i: Sct^ d+1 =5nUto5/(« (f,) ). 

2: for t = d, d - 1, . . . ,0 do 

3: Set C t = {z e llt+i n 5 7 (oW) : V 1 -^ < fc|i... t }. 
4: Set = argmin {V t+1 z : z eC t }. 

5: Set R f = {ze K t+1 nU^o^zCo^) : V t+1 z < V t+1 x^}. 
6: end for 

7: return (x^°\ . . . ,x^) 

The additional restriction of the set IZd+i does not change the outcome of the Witness function 
as all solutions x^°\ . . . , x^ generated by the first Witness function are contained in the set TZd+i 
defined in line [1] of the second Witness function. Similarly one can argue that the additional 
restrictions in lines [3] and [5] do not change the outcome of the algorithm because all solutions x^ 
generated by the first Witness function satisfy the restrictions that are made in the second Witness 
function. Hence, if — x^ |/, . . . , = x^\i, then both Witness functions generate the 
same x^°\ 

We will now discuss how much information about V needs to be revealed in order to execute the 
second Witness function, assuming that the additional parameters / and A are given. We assume 
that the coefficients V? are revealed for every t 6 [d] and i £ I. For the remaining coefficients only 
certain linear combinations need to be known in order to be able to execute the Witness function. 
By carefully looking at the Witness function, one can deduce that for t G [d] only the following 
linear combinations need to be known: 

V t \ I -xl%,... t V t \ I .xl%, 
V*|j • (x^ - x W)\i, . . . , • (x^ - x^)\j . 

These terms can be viewed as linear combinations of the random variables Vf , t e [d], i G /, with 
coefficients from {— 1,0, +1}. In addition to the already fixed random variables V*, t e [d], i ^ I 
the following d linear combinations determine the position V 1 ' d x of x — x^: 

V I 1 -x<-%,...,V J d -x<-%. 

An important observation on which our analysis is based is that if the vectors x^\i, . . . , x^\i 
are linearly independent, then also all of the above mentioned linear combinations are linearly 
independent. In particular, the d linear combinations that determine the position of x cannot 
be expressed by the other linear combinations. Usually, however, it is not possible to find a 
subset I C [n] of indices such that the vectors xW\i,...,xl% are linearly independent. By 
certain technical modifications of the Witness function we will ensure that there always exists 
such a set I with |7| < d + 1. Since we do not know the set I and the matrix A in advance, we 
apply a union bound over all valid choices for these parameters, which yields to a factor of 0(n d ) 
in the bound for the probability that there exists a Pareto-optimal solution in B. 

Roglin and Teng [17] observed that the linear independence of the linear combinations implies 
that even if the linear combinations needed to execute the Witness function are revealed in the 
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first step, there is still enough randomness in the second step to prove an upper bound on the 
probability that V 1 '" d x lies in a fixed £-box B that is proportional to e d . The bound proven 
in [T7] is, however, not strong enough to improve Moitra and O'Donnell's result [T3] because it is 
in the order of ®(e d <fi d ), which would yield a bound of 0(n 2d cj) d ) for the smoothed number of 
Pareto-optimal solutions. We show that for quasiconcave density functions the bound in [17] can 
be improved significantly to <d(e d <fi d ), which yields the improved bound of 0(n 2d (f> d ) in Theorem[3] 

Higher Moments The analysis of higher moments is based on running the Witness function 
multiple times. As described above, we bound for a fixed c-tuple (Bi,...,B c ) of e-boxes the 
probability that all of them contain a Pareto-optimal solution. For this, we run the Witness 
function c times. This way, we get for every j £ [c] a sequence x^' ' , . . . , x^ ,d ^ of solutions such 
that x^'^ is the unique candidate for a Pareto-optimal solution in Bj. 

As above, we would like to execute the c calls of the Witness function without having to reveal 
the entire matrix V. Again if we know for a subset / C [n] the values that the solutions i"''', 
j g [c], t g [d], take at these positions, then we do not need to reveal the coefficients V/ with i € I 
to be able to execute the calls of the Witness function. As in the case of the first moment, it 
suffices to reveal some linear combinations of these coefficients. 

In order to guarantee that these linear combinations are linearly independent of the linear 
combinations that determine the positions of the solutions x^ ,0 \ j g [c], we need to coordinate the 
calls of the Witness function. Otherwise it might happen, for example, that the linear combinations 
revealed for executing the first call of the witness function determine already the position of x^ 2: °\ 
the candidate for a Pareto-optimal solution in B^. Assume that the first call of the Witness function 
returns a sequence x^ 1 ^ , . . . , x^' d ' of solutions and that I\ C [n] is a set of indices that satisfies the 
desired property that a^ 1,0 )^, . . . ,x^ x,<i '\i x are linearly independent. In order to achieve that all 
solutions generated in the following calls of the Witness function are linearly independent of these 
linear combinations, we do not start a second independent call of the Witness function, but we 
restrict the set of feasible solutions first. Instead of choosing x^- 2 '°\ . . . ,x^ 2,d ^ among all solutions 
from S, we restrict the set of feasible solutions for the second call of the Witness function to 
S' = Si 1 (x). Essentially, all solutions generated in calls of the Witness function have to coincide 
with x in all positions that have been selected in one of the previous calls. 

This and some additional tricks allow us to ensure that in the end there is a set / C [n] with 
|/| < (d + l)c such that all vectors x^'^\i, j g [c], t £ [d] are linearly independent. Then we can 
again use the bound proven in [17] to bound the probability that V 1 '" d x^'°" > g Bj simultaneously 
for every j 6 [c] from above by a term proportional to e cd (f) cd . With our improved bound for 
quasiconcave density functions, we obtain a bound proportional to e cd (f> cd . Together with a union 
bound over all valid choices for / and the values x^^\i, j £ [c], t £ [d], we obtain a bound 
of nn cd e cd (j) cd on the probability that all candidates x^'°> lie in their corresponding e-boxes for 
a sufficiently large constant n. Together with the bound of 0{n cd / e cd ) for the number of c- 
tuples (Bi, . . . , B c ) this implies Theorem [4] 

Zero-preserving Perturbations If we use the same Witness function as above, then it can 
happen that there is a Pareto-optimal solution x in the £-box B that does not coincide with the 
solution x(°) returned by the Witness function. This problem occurs, for example, if V d ■ x^ -1 ' = 
yd . x (o) t which 

we cannot exclude if we allow zero-preserving perturbations. We recommend to 
visualize this case for d = 2. On the other hand if we knew in advance that V d ■ x^ -1 ' — V d ■ x^°\ 
then we could bound the probability of V d x^ £ (bd, dd + e] already after the solution x^" 1 ) has 
been generated. Hence, if we were only interested in bounding this probability, we could terminate 
the Witness function already after has been generated. Instead of terminating the Witness 

function at this point entirely, we keep in mind that V d x^ has already been determined and we 
restart the Witness function with the remaining objective functions only. 
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Let us make this a bit more precise. As long as the solutions generated by the Witness 
function differ in all objective functions from x, we execute the Witness function without any 
modification. Only if a solution iW is generated that agrees with x in some objective functions, 
we deviate from the original Witness function. Let K C [<j] denote the objective functions in 
which &w coincides with x. Then we can bound at this point the probability that V t -x € (bt, h+e] 
simultaneously for all t G K. In order to also deal with the other objectives t ^ K, we restart 
the Witness function. In this restart, we ignore all objective functions in K and we execute 
the Witness function as if only objectives t ^ K were present. Additionally we restrict in the 
restart the set of feasible solutions to those that coincide in the objectives t G K with x, i.e., 
to {y G S | Vi G K : V 1 ■ y — V 1 ■ x}. With similar techniques as in the analysis of higher moments 
we ensure that different restarts lead to linearly independent linear combinations. 

This exploits that every Pareto-optimal solution x is also Pareto-optimal with respect to only 
the objective functions V t with t £ K if the set S is restricted to solutions that agree with x in all 
objective functions V 1 with t G K. This property guarantees that whenever the Witness function 
is restarted, x is still a Pareto-optimal solution with respect to the restricted solution set and the 
remaining objective functions. 

It can happen that we have to restart the Witness function d times before a unique candidate 
for a Pareto-optimal solution in B is identified. As in each of these restarts at most d solutions 
are generated, the total number of solutions that is generated can increase from d + 1, as in the 
case of non-zero-preserving perturbations, to roughly d 2 . The set / C [n] of indices restricted to 
which these solutions are linearly independent has a cardinality of at most d 3 . The reason for this 
increase is that we have to choose more indices to obtain linear independence due to the fixed zeros. 
Taking a union bound over all valid choices of /, of the values that the generated solutions take at 
these positions, and the possibilities when and due to which objectives the restarts occur, yields 
TheoremQ] This theorem relies again on the result about linearly independent linear combinations 
of independent random variables from |17j and its improved version for quasiconcave densities that 
we show in this paper. 

4 Non-zero-preserving Perturbations 

4.1 Smoothed Number of Pareto-optimal Solutions 

To prove Theorem[3]we assume without loss of generality that n > d+ 1 and consider the following 
function called the Witness function. It is very similar to the one suggested by Moitra and 
O'Donnell, but with an additional parameter /. This parameter is a tuple of forbidden indices: it 
restricts the set of indices we are allowed to choose from. For the analysis of the smoothed number 
of Pareto-optimal solutions we will set / = (). The parameter becomes important in the next 
section when we analyze higher moments. 

Witness(V, x, I) 
1: Set K d+1 =SnS I (x). 
2: for t = d, d— 1, . . .,0 do 
3: Set C t = {z G Kt+i : V x " A z < V 1 - t x). 
4: if Ct ^ then 

5: Set x w = argmin{^* +1 z : z G C t ). 
6: if t = then return x^ 

7: Determine the first index i t G [n] such that xf' ^ Xi t . 

8: /•HJU(ij) 

9: Set TZ t = {ze Kt +1 r\Si(x) : V t+1 z < V t+1 x^}. 

10: else 

11: Set i t = min([n] \ I). 
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12: 



7fn/U (it). 



13: 



Set xf = 



otherwise . 



14: Set TZ t = TZ t +i C\Si{x). 
15: end if 
16: end for 

17: return (_L, . . . , _L). 

Note, that Co = TZ\ since V 1 "' t z < V 1 '" t x is no restriction if t = 0. For t > 1 the index i t 
in line [7] exists because V 1 x^ < V x x, which implies x^' ^ x. In the remainder of this section 
we only consider the case that x is Pareto-optimal, that I contains pairwise distinct indices and 
that the number |J| of indices contained in / is at most n — (d + 1). This ensures that the 
indices io, . . . , i& exist. Unless stated otherwise, we assume that the OK-event occurs. That means 
that \V k • (y — z)\ > e for every k G [d] and for any two distinct vectors y ^ z G S and that 
for any k G [d] there is an index i G [n] such that \V^\ < 1. The first property ensures, amongst 
others, that there is a unique argmin in line[S] The latter property, which holds with probability 1, 
ensures that By{x) S B e for any vector x G { — 1,0, 1}™. 

Lemma 6. The call Witness(V, x, I) returns the vector x' -* = x. 

Proof. Like Moitra and O'Donnell, we prove two claims to show Lemma [5] 

Claim 1. There is no t £ [d + 1] such that there exists a solution z £ TZ t that dominates x with 
respect to the objectives V , . . . , V 4 . 

Claim 2. For any t G [d + 1] the solution x is an element oflZt- 

If both claims hold, then the correctness of Lemma H2 follows immediately: Claim [5] yields 
x G Co = TZ\ and, hence, we return a;*- ) = argmin {V x z : z G Ci} defined in line [SJ Due to 
Claim [0 = x. 

Let us first consider Claim [TJ For t = d + 1 the statement follows from the Pareto-optimality 
of x. Thus, let t < d and assume for contradiction that there is a vector z G IZt C 7?. t+1 that 
dominates a: with respect to V 1 -- 1 . Then, z ^ x and consequently V 1 --- t z < V 1 '" t x due to the 
occurrence of the OK-event. Hence, we obtain z G Ct and F t+1 a; ( -*- ) < due to the choice 

of i« in line[SJ This contradicts the inequality V t+1 z < V t+1 x^ which holds since z G IZt and 
due to the construction of IZt in line [S] 

To prove Claim [5] we use downward induction over t. For t = d + 1 the claim trivially holds 
as x G 5 and as x G <->/(&). Now, let t < d. If Ct — 0, then x € IZt since x G 72-t+i by the 
induction hypothesis and since x G Si{x). If Ct ^ 0, then assume for contradiction that x ^ IZt- 
Since x G 7^ f+ i fl Sj{x) due to the induction hypothesis, this yields V t+1 x^ < V t+1 x which 
contradicts Claim [1] as it implies together with x^' G TZt+i that x^ dominates x with respect to 
the objectives V 1 ,...,V t+1 . □ 

We will see that for the execution of the call Witness (V, x, J) we only need some information 
about x. This is one crucial property which will help us to bound the expected number of Pareto- 
optimal solutions. 

Definition 7. Let x^°\ . . . , x^ be the vectors constructed during the call Witness(V, x, /), let I 
be the tuple I at the moment when the Witness function terminates and let i* = min ([n] \ JJ . We 
call the pair (I*, A*) where I* = I U (f) and A* = [x (rf) , . . . , x^] the (V, J)-certificate of x. The 
pair (I*, A* |j. ) is called the restricted (V, 7)-certificate of x. We call a pair (/', A 1 ) a (restricted) I- 
certificate, if there exist a realization V such that OK(U) is true and a Pareto-optimal solution x G 
iS such that (I', A') is the (restricted) (V, I) -certificate of x. By ^(1) we denote the set of all 
restricted I -certificates. 
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For the analysis of the first moment we only need restricted /-certificates. Our analysis of 
higher moments requires more knowledge about the vectors x^ than just the values Xi for i £ I*. 
The additional indices are, however, depending on further calls of the Witness function which we 
do not know a priori. This is why we have to define two types of certificates. For the sake of 
reusability we formulate some statements more general than necessary for this section. 

Lemma 8. Let V be an arbitrary realization where OK(V) is true, let x be a Pareto- optimal 
solution with respect to V, and let (I*, A) be the restricted (V, I) -certificate of x. Then, I* = 
(Ji, . . . ,j\n+d+i) consists of pairwise distinct indices and 



A 



' "31 



"J|J|+1 



"3\i\ + i 



G{0,1} 



(|/|+d+l)x(d+l) 



where each can either be or 1 independently of the other '*'- entries. 

Proof. Lemma [6] implies that the last column of A equals Hence, we just have to consider 

the first d columns of A. Note, that I = (j\, . . . ,j\i\) and Jm+i, • • ■ , j\i\+d+i = id, ■ ■ ■ , ii, £*• Let 
It — I U (id, . . . ,it). The construction of the sets IZt yields !Z t Q <Sj t (x) (see lines [TJ |H1 and IT4")) . 
Index i t is always chosen such that it £ h+i- If it is constructed in line [71 then xf^ ^ Xi t . Since 
in that case we have x^' 6 TZt+i Q Si t+1 (x) and, hence, index i t cannot be an element of It+i- In 
line llll index i t is explicitely constructed such that it ^ h+i- The same argument holds for the 
index i*. Hence, the indices of I* are pairwise distinct. 

Now, consider the column of A corresponding to vector x^> for t € [d]. If Ct — 0, then the form 
of the column follows directly from the construction of x^' in line Q2] and from the fact that the 
indices of /* are pairwise distinct. If C t ^ 0, then x^ G C t C 7Z t +i C Si t+1 (x), i.e 
with x in all indices i £ It+i- By the choice of it in line [7] we get x 
proof. 



(*) _ 



x^ agrees 



This concludes the 
□ 



Let (I*, A*) be the (V, 7)-certificate of x and let J D /* be a tuple of pairwise distinct indices. 
We consider the following variant of the Witness function that uses information about x given by 
the index tuple J, the bit matrix A = A*\j with columns a^ d \ . . . , , a shift vector u and the 
e-box B = By (x — u) instead of vector x itself. The meaning of the shift vector will become clear 
when we analyze the probability of certain events. We will see that not all information about V 
needs to be revealed to execute the new Witness function, i.e., we have some randomness left 
which we can use later. With the choice of the shift vector we can control which information needs 
to be revealed for executing the Witness function. 

Witness(V, J, A, B, u) 
1 
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Let b be the corner of B. 

Set K w = 5nUL5j(« (i,) ). 
for t = d, d — 1,...,0 do 

Set Ct = {z£Ht + inSj(aW) 
if C t ^ then 

Set x^ 1 = argmin {V* +1 z : z 
if t = then return a;^ 
Set lit = {z e Kt+i n\J t 72 Sj(a^) : V t+1 z < V t+1 x^}. 
else 

Set a;W = (_L,...,_L). 

Set TZ t =Kt+in\J t -2 S J (a^). 



V 1 - t -{z-u)<b\ 1 ... t }. 
eC t }. 
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12: end if 

13: end for 

14: return (_L, . . . , _L) 

Lemma 9. Let {I*, A*) be the (V, I) -certificate of x, let J D I* be an arbitrary tuple of pairwise 
distinct indices, let A = A*\j, let u G {0, 1}™ be an arbitrary vector, and let B — By(x — u). Then, 
the call Witness(V, J, A, B, u) returns vector x. 

Before we give a formal proof of Lemma [5] we try to give some intuition for it. Instead of 
considering the whole set S of solutions we restrict it to vectors that look like the vectors we want 
to reconstruct in the next rounds, i.e., we intersect the current set with the set \JI~-q S j (^a^ •*) in 
round t. That way we only deal with subsets of the original sets, but we do not lose the vectors 
we want to reconstruct since J D I*. 

Proof. Let lZ' t , C' t , and x 1 ^' denote the sets and vectors constructed during the execution of the 
call Witness(V, J, A, B, u) and let TZt, Ct, and x^' denote the sets and vectors constructed during 
the execution of call Witness(V, x, I). By induction we prove the following statements: 

• K C Tl t for any t G [d+1]. 

• x'W = x (t) for any t G [d] for which C t ^ 0. 

• x^") £K' t for any t £ [d+1] and any t' G [t - 1] for which C v ^ 0. 

With those claims Lemma [5] follows immediately: Since x^ = x and Co ^ due to Lemma [5] we 
obtain x^ G 7Z[ — C' and, hence, we return x'^ = x^ = x. 

Let us first focus on the shift vector u and compare line [3] of the first Witness function with 
line [4] of the second Witness function. The main difference is that in the first version we have the 
restriction V 1 '" t z < V 1 '" t x, whereas in the second version we seek for solutions z such that V 1 '"* ■ 
(z — u) < b\i...t- As b is the corner of the £-box B — By(x — u), those restrictions are equivalent 
for solutions z G S since 

V x - A -(z-u) < b\x...t V 1 "* -(z-u) < V 1 - 1 -{x-u)^ V 1 - t z < V 1 '" t x . 

The first inequality is due to the occurrence of the OK-event. Now, we prove the statements by 
downward induction over t. Let t = d+1. Lemma [5] yields )| = x\i for any t' G [d]o, i.e., 

Ut'=o >^j( a ^'' ) ) £ Si(x). Consequently, 7l' d+1 C TZd+i- Consider an arbitrary index t' G [d] for 
which Ct, ^ 0. Then, x^ G 7l t - C S and x^ G Sj(a^). Hence, x^ G TZ' d+v 

Let t < d. By the observation above we have C[ = {z G TZ' t+1 n<Sj(a (t) ) : V 1 -^ < V 1 - t x) 
and Ct — {z G TZt+i ■ V 1 --- t z < V 1 - t x}. Since TZ' t+1 C IZt+i, we obtain C[ C C t - We first consider 
the case C t = 0, which implies C' t = 0. Then, VJ t = W t+1 n \j\T= «Sj(a (t ' } ) and 7l t = Ut+\ H 5/(x) 
for the current index tuple /. According to Lemma [8j all vectors x^°\ . . . , x^ -1 ^ agree with x 
on the indices i G I. Thus, Ut^o^jC ^ "0 — ^i( x )> an d as ^t+i £ ^t+i we obtain C 7£t. 
Let t' G [t — 1] be an index for which Cf / 0. Then, x^ > G TZ' t+1 by the induction hypothesis, 
x^ > G Sj(a^ and consequently x^ > £TZ' t . 

Finally, let us consider the case C t ^ 0. The induction hypothesis yields x^> G H' t+l . 
Since x® G Sj(a,W) and V 1 - 4 ■ < y 1 -*^, also iW G and thus ^ 0. Hence, x'W = a;W 
as Cj C Ct- The remaining claims have to be validated only if t > 1. Then, 1Z' t = {z G 

nUr=o 5 ./( a(t,) ) : < ^* +1 ^ (t) } and ft t - {z G %nSj(j) : F t+1 z < 

for the current index tuple /. With the same argument used for the case Ct — we obtain 
"R-'t+i n Ut'^o ^( a ^* ') — T^t+i H 5/(a;) and thus 7?.^ C TZ t - Consider an arbitrary index t' G [t — 1] 
for which C t , ^ 0. Then, G 7^ t ' C 7^. In particular, < V t+1 x (t \ Furthermore, 

x^ ' G Ti-'t+i due to the induction hypothesis, and x^ ' G Sj(a^ Consequently, x^ ' G lZ' t . □ 
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As mentioned earlier, with the shift vector u we control which information of V has to be 
revealed to execute the call Witness(V, J, A, B, u). While Lemma [9J holds for any vector u, we 
have to choose u carefully for our probabilistic analysis to work. We will see that the choice 
u* = u*{J,A), 



i = l , 
i g J\(<*) 
otherwise , 



(1) 



is appropriate. Note, that i* is the index that has been added to / in the definition of the (V, I)- 
certificate to obtain I*. Moreover, the values Xi are encoded in the last column of A for any 
index i 6 J according to Lemma[5J Hence, if (/*, A*) is the (V, J)-certificate of x, then vector u* 
can be defined when a tuple J D I* and the matrix A = A* \ j are known; we do not have to know 
the vector x itself. 

For bounding the number of Pareto-optimal solutions consider the functions xi*,a,b{V) pa- 
rameterized by an arbitrary restricted /-certificate (/*, A), and an arbitrary e-box Bel £ , defined 
as follows: XI*,A,b(V) = 1 if the call Witness (V, I*, A, B, u*{ J, A)) returns a solution x' G S such 
that B v (x' - u*(I*,A)) = B, and xi*,a,b{V) = otherwise. 

Corollary 10. Assume that OK(V) is true. Then, the number PO(V) of Pareto-optimal solutions 
is at most E(/«,A)e^(/) Esgi e Xi-,a,b{V). 

Proof. Let x be a Pareto-optimal solution, let (I*, A) be the restricted (V, I)-certificate of x, 
and let B = B v (x - u*(I*,A)) G B e . Due to LemmaEE Witness(F, I*, A, B, u*(I*,A)) returns 
vector x. Hence, Xi*,A,b(V) = 1. It remains to show that the function x M> (I*, A, B') is injective. 
Let Xi and £2 be distinct Pareto-optimal solutions and let (I^,Ai) and (/|, A2) be the restricted 
(V, /)-certificates of x\ and X2, respectively. If (J*,Ai) 7^ (Jj, A2), then x\ and X2 are mapped to 
distinct triplets. Otherwise, u*(I*,Ai) = u*(I%,A 2 ) and, hence, B v (x\ - u*(I*,Ax)) ^ B v (x 2 - 
u*{IZ)A2)) because OK(V) is true. Consequently, also in this case x\ and x-i are mapped to 
distinct triplets. □ 

Corollary [TU] immediately implies a bound on the expected number of Pareto-optimal solutions. 

Corollary 11. The expected number of Pareto-optimal solutions is bounded by 



Ey [PO(V)\ < E Prv \- E I'^b] + 2" • Pr y [OK(V) 

where Ei* y A,B denotes the event that the call Witness(V, I*, A, B, u*{I*, A)) returns a vector x' 
such that B v (x' - u*(I*,A)) = B. 



Proof. Using Corollary [TU] we obtain 



Ey [PO(V)} = Ey [PO00 I OK(V)] • Pry [OK(V)] + Ey PO(V) I OK(V) 



Pri 



OK(V) 



< Ei 



< E, 



E E Xi*,a,b(V) I OK(V) 



Pry [OK(V)] + 2™ • Pry OK(V) 



E 52xi;a,b(V) +2" -Pry OK(V) 

(J*, A) B£K E 

= E E Pr ^ 1 E i;a,b] + 2" • Pry \OK(V) 



□ 
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We will see that the first term of the sum in Corollary QT] can be bounded independently of e 
and that the limit of the second term is for e — > 0. First of all, we analyze the size of the restricted 
certificate space. 

Lemma 12. The size of the restricted certificate space ^(Jo) for Iq = () is bounded by \ c /o(Io)\ < 
2 (d+i) 2 . n d = ( n d} j or fi xed d 

Proof. Given an index tuple Jo, exactly d indices ii, ■ ■ ■ ,id are created in Witness(V, x, Jo) if OK(V) 
is true and if x is Pareto-optimal with respect to V . The index i* is determined deterministicly 
depending on the indices ii,...,id- Matrix A of any restricted Jo-certificate (J*, A) is a binary 
(d + 1) x (d + l)-matrix. Hence, the number of possible restricted Jo-certificates is bounded 
by 2( d+1 ) 2 ■ n d . □ 

Let us now fix an arbitrary J-certificate (I*, A*), a tuple J D I*, and an e-box B 6 B E . We 
want to analyze the probability Pry [Ej.a.b] where A = A*\j. By Vj and V-j we denote the part 
of the matrix V 1 " that belongs to the indices i e J and to the indices i £ J, respectively. For our 
analysis let us further assume that the Vj is fixed as well, i.e., we will only exploit the randomness 
ofVj. 

As motivated above, the call Witness(V, J, A, B, u) can be executed without the full knowledge 
of Vj. To formalize this, we introduce matrices Qk that describe the linear combinations of Vj we 
have to know: 

Q k = [p^ , . . . , p W , p(*-*) - p(fe-D , . . . , p (0) _ p (*-D] ( 2 ) 

for p'*' = p^(J, A, u) — —u\j where are the columns of matrix A. Note, that the matrices 
Qk = Qk(J, A, u) depend on the pair (J, A) and on the vector u. 

Lemma 13. Let u G {0,1}" be an arbitrary shift vector and let U and W be two realizations 
of V such that U-j = Wj and Uj ■ q = Wj ■ q for any index k G [d] and any column q of the 
matrix Qk(J, A, u). Then, the calls Witness(?7, J, A, B, u) and Witness(T4^ J, A, B, u) return the 
same result. 

Proof. We fix an index k G [d] and analyze which information of Vj is required in the call 
Witness(V, J, A, B, u). For the execution of line 2] we need to know V k ■ (z — u) for solutions z G 
<5j(o^) in all rounds t > k. Since we can assume Vj to be known, this means that Vj- (z\j—u\j) — 
Vj ■ (a^ — u\ j) = Vj • p^ must be revealed. For t > k vector p^ is a column of Qk- The 
execution of line |H] does not require further information about Vf as all solutions in Ct agree 
on the indices i G J. Hence, it remains to consider line [5] Only in round t = k — 1 we 
need information about V h . In that round it suffices to know Vj ■ (z\j — x^\j) for any so- 
lution z G Utio Sj( a ^ ■*)• Hence, for z G Sj(a^^), t' £ [k — 2]o, the linear combinations 
I/ 7 fc • (z\j - asW|j) = Vj ■ ((aC) - u|j) - (a^ 1 ) - u\j)) = V* ■ (pM ~ p^-^) must be re- 
vealed. Since t' € [k— 2]q, this is a column of Qk- As U and W agree on all necessary information, 
both calls return the same result. □ 

We will now see why u* — u*(J, A) defined in Equation ((TJ) is a good shift vector. 

Lemma 14. Let Q = [p (d) , . . . ,p (0) ] where = p W ( J, A, u*(J, A)) \ Jt . Then, 



\Q\ = 



... 1 * 
: : '•• 



: : ' ■ * 
0... 1 



T 



G{0,1} 



(|/|+d+l)x(d+l) 



where \Q\ denotes the matrix Q' for which q'^ = \qij\ 
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Proof. Let I* = . . . ,j\i\ +d +i), i-e., i* 
of vector u* in Equation ((1} we obtain 



: jm+d+i. According to Lemma |8] and the construction 



x 



3\l\ 



"3\i]+i 
* 



"J|ii+a 



c i|/i+i 



"3\i\+d 



' ji 



c i|/i+i 



c i|j|+d+i 



■"J|/|+d+l 

The claim follows since a — a = and a — a G { — 1, 1} for a G {0, 1}. 



"31 



■"3\l\ 
C 3\l\ + l 



^3\I\+d 
L J\l\+d + i 



□ 



Lemma 15. For any k G [d] £/ie columns of the matrix Qk(J, A,u* (J, A)) and i/ie vector p( ' are 
linearly independent. 

Proof. It suffices to show that the columns of the submatrix Qk = Qk\j, and the vector p^> = 
P^\j, are linearly independent. For this, we introduce the vectors p^ — p^\ Jt - Consider the 
matrix Q — [p^ , . . . , p^] . Due to Lemma [14] the last d + 1 rows of Q form a lower triangular 
matrix and the entries on the principal diagonal are from the set {—1, 1}. Consequently, the 
vectors are linearly independent. As these vectors are the same as the columns of matrix Q\ 
plus vector p(°>, the claim holds for k = 1. Now let k > 2. We consider an arbitrary linear 
combination of the columns of Qk and the vector p(°) that computes to zero: 





fc-2 

r£v(p«- 




t=k 


t=0 






fe-2 


/fe-2 


t=fc 




\t=o 



*(0) 



As the vectors p"> are linearly independent, we first get At = for t G [d] \ {k — 1}, which 
yields Ao = and, finally, n — 0. This concludes the proof. □ 

Corollary 16. For an arbitrary restricted I -certificate (/*, A) the probability of the event Ei* ,a,b 
is bounded by 

Pr v [E I ,^ B ]<2 d - 1 ''- d ^ d -e d 
if all densities are quasiconcave, where 7 = d ■ + d + 1). 

Proof. Event E]* a,b occurs if the output is a vector x' such that By(x! — «*(/*, A)) = B. In 
accordance with Lcmma [T3l thc output vector x' is determined when for any k G [d] the linear combi- 
nations of Vj, given by the columns of matrix Qk are known. The equality By (x' — u* (I* , A)) = B 
holds if and only if 

V k ■ (x' -u*(I*,A)) = V±- (a/-K*(/*,A))|- F +F J fe , • (x' - u*(I*,A))\ Im G B 

holds for any k G [d]. Since is determined and as (x' - u*(I*,A)) | = a^ ^ — u*(I*, A)\i* — p^ 
for the vector p(°) — p(°) (j* , A, u* (I* , A)) , this is equivalent to the event that 

Vj\ ■ E B — Vj^ ■ [x' — u*(I*,A)) y =: C fc , 

where Cfc is an interval of length e depending on 2/ and hence on the linear combinations of Vj* 
given by the matrices Qfe- We do not explicitly mention the dependence on V— as we consider it 
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to be fixed. By C we denote the d-dimensional hypercube C — Yik=i Ck with side length e defined 
by the intervals Ck- 

For any k £ [d] let Q' k be the matrix consisting of the columns of Qk and the vector p(°\ This 
matrices form the diagonal blocks of the matrix 



6{-i,o,ir 



d-\r\xd-(d+l) 



Lemma 1151 implies that matrix Q' has full rank. We permute the columns of Q' to obtain a 
matrix Q where the last d columns belong to the last column of one of the matrices Q' k . Let the 
rows of Q be labeled by Qj lt i,- ■ ■ , Qj m ,i, ■ ■ ■ , Qj x ,d, ■ ■ ■ , Qj m ,d assuming that I* =(ji,... ,j m ). We 
introduce random variables Xj t k — V^, j 6 I* , k € [d], labeled in the same fashion as the rows of Q. 
Event Ef a,b holds if and only if the d linear combinations of the variables Xj^ given by the last d 
columns of Q fall into the d-dimensional hypercube C depending on the linear combinations of the 
variables Xj } k given by the remaining columns of Q. The claim follows by applying Theorem 1351 
for the matrix Q T and due to the fact that the number of rows of Q is d ■ = 7. □ 

Proof of Theorem We begin the proof by showing that the OK-event is likely to happen. For 
any index t G [d] and any solutions x ^ y E S the probability that \V t x — V t y\ < e is bounded 
by 2(j)£. To see this, choose one index i € [n] such that xt ^ yt and fix all coefficients Vj for j ^ i. 
Then, the value V* must fall into an interval of length 2e. A union bound over all indices t S [d] 

and over all pairs (x,y) 6 S x S such that x y yields Pry 

With Iq = () and 7 = d ■ (d+ 1) we obtain 



OK(V) 



< 2 2n+1 dd>e. 



Ey [PO( V)] < Y, S Prv ^ El * ' A ' B ^ + 2 " ' Prv l 0K ( V ) 

[I* ,A)eV(i ) seB e 



< 



< 



™< .s d -e d + 2 3n+1 c 



(I*,A)^(I ) B& 



< ■ n d ■ d ■ 2 d ■ (d + l) 2d2 -4> d -e d + 2 3n+1 dcbe 

< 2^ 2 ■ (d + l) 2d2 ■ n 2d ■ d> d + 2 3n+l d<Pe . 



The first inequality is due to Corollary [TTJ The second inequality is due to Corollary [TOl The 
fourth inequality stems from Lemma |l 21 Since this bound is true for arbitrarily small e > 0, the 
correctness of Theorem [3] follows. □ 



4.2 Higher Moments 

The basic idea behind our analysis of higher moments is the following: If the OK-event occurs, 
then we can count the c th power of the number PO(V) of Pareto-optimal solutions by counting all 
c-tuples (Bi, . . . ,B C ) of e-boxes where each e-box Bi contains a Pareto-optimal solution Xi. We 
can bound this value as follows: First, call Witness (V, Xx, ()) to obtain a vector x[ and consider 
an index tuple I\ containing all indices created in this call and one additional index. In the second 
step, call Witness(V, X2, I\) to obtain a vector x' 2 and consider the tuple I2 consisting of the indices 
of I\, the indices created in this call, and one additional index. Now, call Witness(V, X3, 12) and 
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so on. If (x\, . . . ,x c ) is a tuple of Pareto-optimal solutions with V 1 af» G -Bi for i G [c], then 
(x' l5 . . . ,2^) = (xi, . . . ,x e ) due to Lemma [6] As in the analysis of the first moment, we use the 
variant of the Witness function that uses certificates of the vectors xe instead of the vectors itself 
to simulate the calls. Hence we can reuse several statements of Subsection 14.11 

Unless stated otherwise, let V be a realization such that OK(V) is true and fix arbitrary 
solutions xi, . . . ,x c G <S with V 1 " Xi G Bi for i G [c] that are Pareto-optimal with respect to V. 

Definition 17. Let Iq — () and let (ij, A|) be the (V, I^-x)' certificate of xi defined in Definition^ 
I = 1, . . . ,c. We call the pair {I*, A), where I* = I*, A = (Ax, . . . ,A C ) for A e = A* t \ , the V- 
certificate of {x\, . . . , x c ). We call a pair (I', A') a c-certificate, if there is a realization V such 
that OK(V) is true and if there are Pareto-optimal solutions Xi, . . . , x c G S such that (/', A') is 
the V -certificate of (x\, . . . , x c ). By ^ c we denote the set of all c- certificates. 

Note, that 1$ C . . . C I* and \I* \ = + d + 1 for I G [c]. 

We now consider the functions Xi* a parameterized by an arbitrary c-certificate (I*, A) G 

ff c and a vector B G B° of e-boxes and defined as follows: Xi* ^ _g(^0 = 1 if for any I G [c] the 
call Witness (V, I*, At, B e , u*(I*, Ae)) returns a solutions x' t such that B v (x' g — u* (I* , At)\ = Bt, 
and Xi* a B^) = otherwise. The vector u* — u*(I* , Ae) is the one defined in Equation[TJ 

Corollary 18. Assume that OK(V) is true. Then, the c th power of the number PO(V) of Pareto- 
optimal solutions is at most A)e€ SseB c A b(^0- 

Proof. The c th power of the number PO(V) of Pareto-optimal solutions equals the number of 
c-tuples (x\,...,x c ) of Pareto-optimal solutions. Let (xi,...,x c ) be such a c-tuple, let (I*, A) 
be the ^-certificate of (x\, . . . , x c ), an let Be = By(xe — u*(I*,Ae)) G B e . Due to Lemma El 
\l±tness(V, I* , A e , B e ,u*(I* , Ae)) returns vector x e for any £ G [c]. Hence, %i* a = 1 f° r 

£> = . . . , B c ). To conclude the proof we show that the function (sci, . . . , x c ) h-> (/*, A, £>) is 
injective. Let (xi,...,x c ) and (yi,...,y c ) be distinct c-tuples of Pareto-optimal solutions, i.e., 
there is an index I G [c] such that xe ^ y^, and let (I*,A^) and (J|, A^) be their ^-certificates. 
If (If, A (1) ) ^ A< 2 )), then both tuples are maped to distinct triplets. Otherwise, u*(I*,A^) = 

u*m,Af ] ) and thus B v (x e - u*{Il,A { p)) B v (y e - u*{L^,Af ] )) since OK(V) holds. Conse- 
quently, also in this case (x\, . . . , x c ) and (yi, . . . , y c ) are mapped to distinct triplets. □ 

Corollary 1181 immediately implies a bound on the c th moment of the number of Pareto-optimal 
solutions. 

Corollary 19. The c th moment of the number of Pareto-optimal solutions is bounded by 
Ey [PO c (V)] < E _E Pl > [ E i;A,s] + 2 " ' Pl > [O^O 7 )" 

where Ej, ? ^ denotes the event that Xi* 2 s(^0 = 1- 

We omit the proof since it is exactly the same as the one of Corollary [TT1 

Lemma 20. The size of the restricted certificate space is bounded by \%\ < 2 c2 < d+1 ^ -n cd = 0(n cd ) 
for fixed c and d. 

Proof. Let (I*, A) be an arbitrary c-certificate. Each matrix Ae is a |7*| x (d + l)-matrix. The 
tuple I* can be written as I* = (i^ , . . . , i^ 1 , . . . , v£ , . . . , i*^), created by c successive 
calls of the Witness function, where the indices i*^ are chosen deterministicly in Definition [7] 
Since \I* \ = c • (d+ 1) the claim follows. □ 
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Corollary 21. For an arbitrary restricted c-certificate (I*, A) and an arbitrary vector B G B^j of 
e-boxes the probability of the event Ej, r g is bounded by 



and by 



Pr, 



Pr, 



E 



I*,A,B 



<(27) 



7 — erf 



E 



I*,A,B 



< 2 cd . yr-cci 



icd cd 



if all densities are quasiconcave, where 7 = cd ■ (d + 1). 

Proof. For fc G [d] and ^ G [c] consider the matrices Qfe(l*,A^,u|) for u\ — u*(I*,Ai) defined in 
Equation ((2J- Due to Lemma [T3l the output of the call Witness (V, I*, Ag, Bg, u|) is determined 
if Vjt and the linear combinations Vk, ■ q for any index k G [d] and any column q of the ma- 
trix q¥' — Qk(I* 1 Ai,uf) are given. With the same argument as in the proof of Corollary [21] 
event Ej, occurs if and only if Vi* ■ \p^' x \ . . . ,p^ ,d ^] falls into some d-dimensional hyper- 
cube Cg with side length e depending on the linear combinations Vi* ■ Q% • In this notation, 
is short for p^ (I* , A e ,u}). 
Now, consider the matrix 



Q' k = \Q { k\p {1 ' k \ 



Due to Lemma 1141 this is a lower block triangular matrix, due to Lemma [15] the columns of 
[Q^ ,p^' k ^] are linearly independent. Hence, matrix Q' k is an invertible matrix and the same 
holds for the block diagonal matrix 



Q' 



Q'i 



Q'a 



G {-1,0,1} 



(d-\r\) 2 



We permute the columns of Q' to obtain a matrix Q where the last c ■ d columns belong to 



the columns p 



(1,1) 



(i,d) 



,(c,l) 



,p 



We assume the rows of Q to be labeled by 



,P K 

Qj lt i, Qj m ,i, • • • , Qji,d, • • • > Qj m ,d where I* = (Ji, ■ ■ ■ ,j m ) an d introduce random variables 
Xj,k = Vf i 3 € I*> k € [d], indexed the same way as the rows of Q. Event E r , ^g holds if 
and only if the cd linear combinations of the variables Xjk given by the last cd columns of Q 
fall into the cd-dimensional hypercube C — Y\i=i Ci with side length e depending on the linear 
combinations of the variables Xj^ given by the remaining columns of Q. The claim follows by 
applying Theorem [33] for the matrix Q T and due to the fact that the number of rows of Q is 
d-|/*|=cd-(d + l)=7. □ 

Proof of Theorem [^J In the proof of Theorem [3J we show that the probability that the OK-event 
does not hold is bounded by 2 2n+1 d(f>e. For 7 = cd ■ (d + 1), we set s = (2j) 7 ~ cd ■ (fP for general 
densities and s = 2 cd ■ y _cd • <f> cd in the case of quasiconcave density functions. Then, we obtain 



E 



I*,A,B 



E v [PO c (V)}< £ J2 Pr v 

< J2 s ■ e cd + 2 n ■ 2 2n+1 



2 n • Pr, 



OK(V) 



< 2 c2 -( d+1 ) 2 • n cd 




s-e cd + 2 3n+1 d(/)s 
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< 2 c2 < d+1 ^ +cd ■ n 2cd ■ s + 2 in+1 d4>e . 

The first inequality is due to Corollary [HO The second inequality is due to Corollary The 
third inequality stems from Lemma 1201 Since this bound is true for arbitrarily small e > 0, the 
correctness of Theorem 2] follows. □ 

The proof of Theorem [4] yields that 

Ey [PO c (V)] <s c = 4 c ' 2 ( d+1 ) 2 • (cd(d + l)) cd2 ■ n 2cd ■ 

where (3 — d(d + 1) in general and (3 = d for quasiconcave densities. With the following Corollary 
we bound the probability that PO(V) exceeds a certain multiple of sj.. We obtain a significantly 
better concentration bound than the one we would obtain by applying Markov's Inequality for the 
first moment. 

Corollary 22. The probability that the number of Pareto- optimal solutions is at least k ■ s\ is 
bounded by 

1 I log 8 fc 

/ 1 \ 2 Laid+i) 

Pry[PO(V) > fc-si] < 7 



Proof. We bound the probability as follows: 



Pry [PO(V) >k-sx} = Pry [PO c (y) > k c ■ s\] = Pr v 



P0C ^^E7Tp^k- E ^ P0C(F)] 



< Pry 



PO c (V) > — ii • Ey [PO c (V)] 



Sc 

< 



k c ■ si 



k c . ^(d+l) 2 . + ly^cd? . n 2cd . - f,c 



4 c(d+l) 2 . 4 d 2 log 4 c \ / gc(d+l) 2 



We obtain the claimed bound by setting c = [c*\ where c* is chosen such that 8 C = fc 1 / 2 , 

i.e., c* = (log 8 k)/{2{d+l) 2 ). ' □ 

5 Zero-preserving Perturbations 

To prove Theorem [T] we will first show that we can concentrate on a special class of instances. 

Lemma 23. Without loss of generality in each objective function except for the adversarial one 
there are more than f{d) perturbed coefficients, i.e., coefficients that are not deterministically set 
to zero, where f is an arbitrary fixed function. 

Proof. For k e [d] let P^ be the tuple of indices i such that is a perturbed coefficient. Let K 
be the tuple of indices k for which \Pk\ < f(d), let P = [J keK Pk, and consider the decomposition 

of S into subsets of solutions S v = {x G S : x\p = v}, v £ {0, Let x G S v be an arbitrary 
solution. If x is Pareto-optimal with respect to S and V 1 " +l , then x is also Pareto-optimal 
with respect to S v and the objective functions V k , k G [d + 1] \ K . As all remaining objective 
functions V k , k G [d + 1] \ K, have more than f(d) perturbed coefficients, the instance with 
these objective functions and S v as set of feasible solutions has the desired form. Since there are 
2\P\ < 2 d '^( d ) — 0(1) choices for v, the smoothed number of Pareto-optimal solutions for general 
(/>-smooth instances is only by a constant factor larger than the smoothed number of Pareto-optimal 
solutions for oi-smooth instances of the form mentioned in the lemma. □ 
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Lemma 24. Without loss of generality for every i £ [n] exactly one of the coefficients V^, . . . , V d 
is perturbed, whereas the others are deterministically set to zero. 

Proof. We first show how to decrease the number of indices i for which , . . . , V d is perturbed 
to at most one. For this, let S' = {(x,x,...,x) x £ S} C {0,l} d ™ be the set of feasible 
solutions that contains for every x £ S the solution x d £ {0, l} dn that consists of d copies of x. 
For k £ [d] we define a linear objective function W k : S' — > K in which all coefficients W k with i £ 
{(k — l)n + 1, . . . , fcn} are deterministically set to zero. The coefficients Wh_-f. n+l , . . . , W^ n are 
chosen as the coefficients V k , . . . ,V k , i.e., either randomly according to a density f k or zero 
deterministically. The objective function W d+1 maps every solution x'' £ 5' to The 
instance consisting of S' and the objective functions W 1 , . . . , W d+1 has the desired property that 
every binary variable appears in at most one of the objective functions W 1 , . . . , W d and it has 
the same smoothed number of Pareto-optimal solutions as the instance consisting of S and the 
objective functions V 1 , . . . , V d+1 . For every i £ [dn] for which none of the coefficients W/, . . . , Wf 
is perturbed we can eliminate the corresponding binary variable from S' . 

This shows that any </>-smooth instance with <S C {0,1}" can be transformed into another 
0-smooth instance with iS C {0, 1}^ with £ < dn in which every binary variable appears in exactly 
one objective function and that has the same smoothed number of Pareto-optimal solutions. As 
the bound proven in Theorem [T] depends polynomially on the number of binary variables, we lose 
only a constant factor by going from S C {0, 1}™ to S' C {0, l} dn . □ 

In the remainder of this chapter we focus on instances having the structure described in 
Lemma [23] and Lemma l24l Then, (Pi, . . . , P^) is a partition of [n], where Pt denotes the tu- 
ple of indices i for which V* is perturbed. This structure also ensures that By{x) £ B e holds for 
any vector {—1,0, 1}™. 

We consider the following variant of the Witness function, which gets as parameters besides 
the usual V and x, a set K C [d] of objective functions, a call number r £ N, and a set / C [n] of 
indices. In a call of the Witness function only the adversarial objective function V d+1 and the 
objective functions V 1 with t £ K are considered, and the solution set is restricted to solutions 
that agree with x in all positions Pk with k ^ K . Additionally, as in the Witness function for 
higher moments, only solutions are considered that agree with x in all positions i £ I. By the 
right choice of /, we can avoid choosing an index multiple times in different calls of the Witness 
function. The parameter r simply corresponds to the number of the current call of the Witness 
function. The Witness function always returns some subset of S. 

Witness (X / , x, K, r, I) 
l: Let K be of the form K = [k\, . . . , 
2: Set kd'+i = d + 1. 

3: Set =SnSj(x)n f) k e[d]\K Sp k (x). 

(r) 

4: if d — then return 7l d , +1 
5: for t = d',d' - 1, . . . ,0 do 

6: Set C t (r) = {z £ : V kl - kt z < V kl - kt x}. 

7: if C[ r) ^ then 

8: Set X[ r) = arg min { V kt+1 z : z £ c[ r) } . 

9: Let x^ r '^ be the canonically first vector in . 

10: Let i^EQ C K be the tuple of indices k such that x^ r,t ^\p k = x\p k . 

11: Let i^NEQ C K be the tuple of the remaining indices. 

12: for k £ K do 
13: if k £ K E q then 

14: Set rj; = r. 
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15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 

31 

32 
33 
34 
35 



else 

Determine the first index i G Pk such that xf ^ 
/M/U(i) 
end if 
end for 

if K E q = then 

Set TZ[ r) = {zc Kj'\nS/(a:) : < V^a:^}. 

else 

Set — t. 

return Witness (V, x, K^eq, r + 1, /) 
end if 
else 

for 6 K do 

Set «fe = min(P fe \ I). 
/<h/U(h). 
end for 

c (r,t) = f x7 : ie {i fcl ,...,i kd/ }, 
1 1 Xi : otherwise . 

,(r) _ ^(r) 



Set 



Set KY> = n\y 1 ns I (x). 

end if 
end for 
return 



The index defined in line 1 141 is the number of the last call in which the objective function V k 
has been considered. In the following we use the term round to denote an iteration of the for-loop 
starting in line [S] The index t r defined in line 1231 is the number of the round in call number r 
of Witnesso in which the recursive call of Witness was made. In line El we define the winner 
set C t using the functions V kl , . . . , V kt . Note that in round t = there is no restriction and this 
definition simplifies to — iz\ . In line I16l it is always possible to find an index from i £ Pk on 
which the current vector and x disagree because this line is only reached for k £ -K'neq, i-e., 
only if x( r '^\p k ^ x\p k . In order for line 1281 to be feasible, we have to guarantee that Pk\I =/= 0- 
This follows if we assume |Pfc| > d(d + 1) (which is in accordance with Lemma [231 without loss if 
generality) because there are at most d calls of Witness with non-empty K with at most d + 1 
rounds each and in each round at most one index from is added to /. 

One important property that we will exploit later is that every Pareto-optimal solution x is 
also Pareto-optimal with respect to the objective functions V k with k G J^neq if the set >S is 
restricted to solutions that agree with x in all positions i G UfeeA' EQ Pki i- e -> solutions z for which 
V k z = V k x for every k G Keq- This property guarantees that whenever Witness is called with 
a Pareto-optimal solution x as parameter, also in the recursive call in line[2Ha; is Pareto-optimal 
with respect to the restricted solution set and the remaining objective functions. 

In the remainder of this section we only consider the case that x is Pareto-optimal. Unless stated 
otherwise, we consider the case that the OKo-event occurs. That means that \V k ■ (y — z)\ > e for 
every k £ [d] and for any two vectors y,z £ S for which y\p k ^ z\p k . 

Lemma 25. The call Witness (V, x, [d], 1, ()) returns the set {x}. Moreover, for the 

last constructed vector x 1 -- 7 " »*»**), i.e., for r* = max{ri, . . . ,r<i}. 

Proof. We show that for all tuples K C [d] and / C [n] and any call number r the result of the call 
Witness (V, x, K, r, I) is the set {x} by induction over the size of K. If K is the empty tuple, then 
l?<d'+i = <S PI Si(x) n rifcefd] ^- e -' tne set ^d'+i' wn i cn wm be returned in hne|H consists of 

all solutions that are identical with x in every index. Consequently, TZ^+x = { x } m that case. 
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Now we consider non-empty tuples K C [d]. By the induction hypothesis it suffices to show 
that there is a call to the Witness function before the end of the loop. Hence, let us assume 
that there is no such call during the rounds t = d', d! — 1, . . . , 1. As for the simple variant of the 
Witness function we show two claims for that case: 

Claim 3. There is no t £ [d' + 1] such that there exists a solution z £ TZ-^ that dominates x with 
respect to the objective functions V kl , . . . , V kt . 

Claim 4. For any t £ [d' + 1] solution x is an element oflz[ r \ 

As there is no call to the Witness function during the first rounds we always get i^EQ = () 
whenever c[ r ^ ^ 0. Hence, the rounds are basically the same as the rounds in the simple Witness 
function and both claims hold due to the same arguments used to prove Lemma [5] and the fact 
that x £ Sj(x) for any index tuple J C [n\. 

Using Claim [3] and Claim [5] we can prove the lemma: Solution x is an element of C Q = 1Z\ 
according to Claim SJ In particular, ^ 0. Due to Claim [3] we get x £ Xq^ and, hence, 
yk lx (r,o) _ yk lx _ t ^ e OK -event holds, this implies x^ r '°^\p k — x\ Pki , leading to fci £ K E q. 
Consequently, the tuple K^q is not empty and we call the Witness function again. 

To conclude the proof of the lemma we have to show the equality In the call 

of the Witnesso function with number r* the last recursive call in line [24] is made. We know 
that x^ r , ' r *' ) |p = x\p k for any k £ [d] \ K where K is the parameter of call number r* because 

x {r*,t r ,) g *R}[ r \i Q C\ke[d\\K^ p k ( x )- Additionally, since the recursive call that is made in line I2"4l 
terminates immediately, all objective functions are removed from consideration in the call with 
number r* , i.e., -Keq = K in this call. Hence, the equation x^ r ' tT *'\ p — x\p k holds also for 

any k £ K. Together this implies x^ r = x. □ 

Like for the simple Witness function, we show that it is enough to know some information 
about one run of the Witness function to reconstruct the vector x. As before, we call this data 
the certificate of x. 

Definition 26. Let ri,...,^ and t\,...,t r *, where r* — max{ri, . . . , r^}, be the indices and 
let a;( r '*) be the vectors constructed during the execution of the call WitnessotV, x, [d], 1, ()) and 
let d' r be the value of d 1 in the r th call of the Witness function. Furthermore, consider the 
tuple I at the moment when the last call terminates. The pair (I* , A), where I* = JU (ij, . . . , i^j , 



min(_Pft- \ /), and A 



cO-' d 'i) . . . ,a; (1 '* l) , . . . ,x( r *> d 'r*\ . . . ,x (r "' t - 



, is called the V- 

certificate of x. We label the columns of A by a^ r,t \ Moreover, we call a pair (I', A') a certificate 
if there is some realization V such that OKo(V) is true and there exists a Pareto- optimal x £ S 
such that (I',A') is the V -certificate of x. By c & we denote the set of all certificates. 

We assume that the indices r% and t r (and hence also the indices d' r ) are implicitly encoded 
in a given certificate. Later we will take these indices into consideration again when we count the 
number of possible certificates. 

Lemma 27. Let V be a realization for which OKq(U) is true and let {L*,A) be a V -certificate of 



some Pareto- optimal x. Let A be of the form A 



a ( 1 ^),...,a( 1 ^),..., a (''*'<*),...,a(''*' t -*) 



For fixed k £ [d] set M 



a^ d 'i\...,a^\...,a [rk ' d '^\...,a^' t 



, where J = L* n P k 

J 
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(jij ■ • ■ ,jm), and set y = x\j. Then, M is of the form 

Wi Vh ■■■ Vh 

M = 



Vim-X Vj m - 



£{0,1} 



\J\x\J\ 



where each can either be or 1 independently of the other '^'-entries. 

Proof. Consider the call Witness (V, x, [d], 1, ()) and all subsequent calls Witness (V r , x, K, r, I). 
By definition of we have r < r^ k £ K. In each call where r < r^ one vector x^' 1 ^ 

is constructed each round. Also, in each round except for the last round of the rj(, h call one 
index i G P& is chosen and added to /. Since J consists of the chosen indices i G and the 
additional index matrix M is a square matrix. 

We first consider the last column of M. As the last vector constructed before k is 

removed from K , index k must be an element of K^q at that time, i.e., x^ rk,tr ^ |^ = x\p k . Hence, 
the last column of M has the claimed form. 

Now consider the remaining columns of M. Due to the construction of the set lZ t in line [21] 
or in line 1321 all vectors z in a set 1Z\ agree with x in the previously chosen indices i. As in the 
case c[ r ^ 7^ vector x( r '*) is an element of TZ^+i and in the case C\ r) = vector x^ is constructed 
appropriately, the upper triangle of M , excluding the principal diagonal, has the claimed form. 
The form of the principal diagonal follows from the choice of index i £ P^ such that x'f'^ ^ Xi in 
line [16] or from the construction of x^ r,t ' in line[3TJ □ 

As in the simple case we now consider the following Witness function that is based on the 
guess of the ^-certificate of x, a shift vector u 6 {0, 1}" and the £-box B — By{x — u). 

Witness (y, K, r, I*, A, S' , B, u) 

1: Let K be of the form K = (k\, . . . , kd'). 

2: Set k d , + i = d + 1. 

3: Let b be the corner of B. 

4: set n^^S'nutuS^')). 

if d! = then return 1Z 



5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 



d' + l 

for t = d',d' - 1,...,0 do 

Set C t (r) = {z G nSj.(a( r '*>) : F fel - fet ■(«-«)< &| fcl ...*J. 
if C 4 (r) ^ then 

Set A t (r) = argmin{^ fct + 1 2 : z G C t (r) }. 

Let a;( r, *' ) be the canonically first vector in x[ r \ 
if t = t r then 

Let i^EQ C be the tuple of indices k such that = r. 
Let i^NEQ C K be the tuple of the remaining indices, 
return Witness (y, # NE Q, r + 1, I*, A,S' n f\ e K EQ <$P fc (s< r '*>), -B, u) 
else 

Set 7^ r) = {ze TZgj n U'ri tr 5/. (a^)) : < V** 1 ^*)}. 

end if 
else 

Set x^ = (_L,...,_L). 

Set TZ ( t r) = TZ ( t % n \j\7l tr Sj, (a^'>). 
end if 
end for 
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23: return 

Lemma 28. Let (J*, A) be the V -certificate of x, let u e {0,1}™ be an arbitrary vector, and 
let B = By(x — u). Then, the call Witness (V, [d], 1, I*, A, S, B, u) returns {x}. 

Before we give a formal proof of Lemma [28] we try to give some intuition for it. As for the 
simple variant of the Witness function we restrict the set of solutions to vectors that look like the 
vectors we want to reconstruct in the next rounds of the current call, i.e., we intersect the current 
set with the set \Sv=t ^i* i a< " r ' t ') m round t. That way we only deal with subsets of the original 
sets, but we do not lose the vectors we want to reconstruct. In order to reconstruct the vectors, we 
need more information than in the simple variant: we need to know in which rounds the recursive 
calls of Witnesso are made, in each call we need to know which objective functions V k must not 
be considered anymore, and for each of these objective functions we need to know the vector x\p k . 
The information when the recursive calls are made and which objective functions must not be 
considered anymore is given in the certificate: The variable t r contains the round number when 
the recursive call is made. The index r^ contains the number of the call where index k has to be 
removed from K. Hence, index k is removed in the t^ round of call r^. If we can reconstruct Keq 
and the vector x^ r,t ^ in the round where we make the recursive call, then we can also reconstruct 
the bits of x at indices i G Pk for any index k £ i^EQ since x\p k — x^ r,i ^\p h for these indices k. 

Proof of\^ To prove the lemma we show that in the execution of Witness (V, [d], 1, I*, A, S, B, u) 
essentially the same sequence of vectors x' r,t ^ as in the execution of Witnesso(V, x, [d], 1, ()) is 
generated: 

Claim 5. Let Witness (V, x, K, r, /) be the r th call to the Witness function resulting from the 
first call Witness (V, x, [d], 1, ()). Consider the call Witness (V, K, r, L* , A, S' , B, u) where S' = 
^ ^ C\ke[d]\K Sp k (x) and B = By{x — u). If K = (), then both calls return {x}. Otherwise, the 

following statements hold, where TZ'} r \ C'} r \ X't , x'^ r,t \ K^q, and K^eq denote the variables 
from the call Witness (F, K, r, /*, A, S' , B, u): 

• TZ' t {r) C Tli r) for anyte{t r + l,...,d' + 1}, 

• z'M) = x (r,t) j or any t g _ _ f d >y j or which c (r) ^ 0) 

• x^ S TZ' t {r) for anyte{t r + l,...,d' + 1} and any t' G {t r , . . . , t - 1} for which C t (r) ^ 0, 
and 

• the next calls are both in round t r and are of the form Witness (V, x, K%, r + 1,/') and 
Witness (V,K 2 ,r + l,I*,A,S",B,u) where S" = S n C\ke[d]\K 2 S Pk ( x ) and Ki = K 2 . 

Using Claim [5j Lemma |2"51 follows immediately by an inductive argument. For K = () Claim [5] 
is trivially true. To show the claim for K ^ () we use induction over t. 

First consider the case t = dl + 1, i.e., the moment before entering the loop. By assump- 
tion and the construction of the vectors a^ r,t ^ we have T^jj~li = S' Pi Uf'=t ( a ' r '* ') = (S D 

C\ke[d]\K S P k ( x )) n yft'=t r S i* and Tiy +1 =5n5/(j;)n f)ke[d]\K S Pk ( x )i where 1 denotes 

the current index tuple. By construction, all vectors x( r,t ' agree with x in the indices i G I, and 
since I C I*, the relation TZ'j^ C K { $ +1 holds. As x^ G K { J, } +1 C S n C\ke{d]\K S Ph ( x ) = S ' for 

any index t' G {t r , ...,d'} where C { J } ^ and as x^ E S/» (a (r ^">), also x^ 6 TZ'j'f^ holds for 
these vectors. As we have not entered the loop yet, there is no recursive call for t = d' + 1. 

Now consider the case t < d' . We have C' t [r) = {z e Tl'^l n Sj^a^) : V kl - kt ■ {z - u) < 

b\kx...kt] and C [ t r) = {z e TZ^ : V kl - kt z < V kl - kt x). Since b is the corner of B = B v (x - u) 
and since the OKo-event occurs, the relations V k ■ (z — u) < bk and V k z < V k x are equivalent 
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for any index fc G [aj. Consequently, C t is a subset of C t as 7£ t \ i is a subset of 72-( +1 by the 
induction hypothesis. 

First, we analyze the case that ^ 0. This is in particular the case for t — t r due to the 
construction of t r . Due to the induction hypothesis, x^ r ^ is an element of TZ'}l{ and, due to the 

fact that x^ G 5j. (a^), an element of C' t (r) . Consequently, C' t {r) ^ and, thus, x'^ = x^\ 
If t > t r , then in neither of the calls a recursive call is made because K^q = () = K^q and the 

sets n' t (r) and n[ r) are defined as follows: H' t {r) = {z G H%\ n [ft~=t r S i* (a^* ) : V^z < 
V k t+lx (r,t)^ and n (r) = | z £ n {r)^ ^ Si(x) i < V" fct + 1 } . By the induction hypothesis 

and the construction of the vectors x^ r,t \ 7if t is a subset of IZ^ and x^' 1 ) G T?-^'' 1 for any i' G 
{i r , ...,£ — 1} where Q, ^ 0. If t r = t, then there is a call to Witness (V, x, .Kneq, t + 1,J) 
and a call to Witness (F, #neq> r + l,I*,A,S",B,u), where 5" = S' n flfceK^ 5 ft (^ (r,t) )- In 
accordance with the definition of the indices and the tuple K^q we obtain the equivalence 
k G -ft^EQ ''fc = f -<=> k G for any index k G K and thus ifgg = i^EQ and -Kneq = -Kneq- 
The definition of ^eq yields x^\ Pk = x\ Pk , i.e., f] keKEQ S Pk (x^) = f] k eK EQ S P k ( x )- As 
S' = S n n fee [ d ]\K ( x ) b y assumption, we finally get S" = S n n fce [d]\K NEQ 5 ft fa)- 

Now, let us analyze the case C ( (r) = 0, which implies C' t (r) = 0. Then, no recursive call is 
made and the sets TZ' t {r) and K { t r) are TZ' t {r) = K'£\ n Si* (a( r >*'>) and 7^ r) = n 5j(a;). 

The vectors x^ r, * r \ . . . , x^ r ' t ~ 1 ^ agree with a; in the indices i € I. Consequently, C 
and a;( r '*') G 7v for any t' G {t r , . . . , t — 1} as a;( r '*') G 5j* (a( r,t ')) and as this relation holds for 
lZ t+1 due to the induction hypothesis. This concludes the proof. □ 

By the choice of the vector u we can control which information about V has to be known in 
order to be able to execute the call Witness (V, [d], l,I*,A,S,B,u). While Lemma 1251 is correct 
for any choice of u G {0, 1}™, we have to choose u carefully in order for the following probabilistic 
analysis to work. Later we will see that u* — u*(I*,A), 

i G (»*,-.., ij), 

iei*\(*;,...,*5), (3) 

otherwise , 

is well-suited for our purpose. Note, that it G Pk are the indices that have been added to I in the 
definition of the ^-certificate to obtain I*. Furthermore, a;, is encoded in the last column of A for 
any index i £ I* . Hence, vector u* can be defined with the help of the ^-certificate of x; we do 
not have to know the vector x itself. 

Next we bound the number of Pareto-optimal solutions. For this, consider the following func- 
tions Xi" .a.b{V) parameterized by an arbitrary certificate (J*, A) G and an arbitrary e-box 
B G B e and defined as follows: XI',A,b( V ) = 1 if the call Witness (V, [d], 1, I*, A, S, B, u*(I*,A)) 
returns a set {x 1 } such that By{x' — u*(I*,A)) = B, and Xi*,A,b(V) — otherwise. 

Corollary 29. Assume that OKq(V) is true. Then, the number PO(V) of Pareto-optimal solutions 
is at most E(/-,A)e«'Es G B e Xi*,A,b{V). 

Proof. Let a; be a Pareto-optimal solution, let (J* , A) be the ^-certificate of x, and let B = By (x — 
u*(I*,A)) G B £ . Due to Lemma 1251 Witness (V, [d], 1, I* , A, S , B , u* (I* , A)) returns {x}. Hence, 
Xi*,A,b(V) = 1. It remains to show that the function x n> (I*, A, B') is injective. Let x\ and Xi be 
distinct Pareto-optimal solutions and let (I*,Ai) and {1%, A^) be the ^-certificates of X\ and X2, 
respectively. If (I*, Ax) ^ (J|,j42), then x% and x-i are mapped to distinct triplets. Otherwise, 
u*(I*, Ax) = u*(I$,A 2 ) and, hence, B v {x t -«*(/*, A x )) ^ B v (x 2 - u*(I$, A 2 j) because OK (^) 
is true. Consequently, also in this case x\ and x 2 are mapped to distinct triplets. □ 



xj 

U i — < Xi 





2<S 



Corollary [29] immediately implies a bound on the expected number of Pareto-optimal solutions. 



Corollary 30. The expected number of Pareto-optimal solutions is bounded by 



Ey [PO(V)] < J2 J2 Prv 1 E I',a,b] + 2 n • Pr v [OK (V) 

(/*,A)e^seB e 

where Ei* t A,B denotes the event that the call Witnesso(VJ [d], 1, I*, A, S, B, u*(I*, A)) returns a 
set {x'} such that B v (a/ - u* (J* , A)) = B. 



Proof. Using Corollary 1 2 9 1 we obtain 



Ey [PO(V)] = Ey [PO(V) I OKo(V)] • Pry [OK (V)] + Ey PO(V) | OK (V) 



Pry 



OK (V) 



< E v 



£ E Xi;a,b(V) I OKo(V) 

(/*,A)G<i!?B(EB e 



2" • Pr, 



■Pry[OK (V)]+2"-Pry OK (V) 



OK (V) 



El J2 J2 Xi;a,b{V) 

]T E Pl > \- E i* >Ab] + 2" • Pry [OK (V) 

(7*,A)e^BGB e 



□ 



We will see that the first term of the sum in Corollary [301 can be bounded independently of e 
and that the limit of the second term is for e — > 0. First of all, we analyze the size of the 
certificate space. 

Lemma 31. The size of the certificate space is bounded by \^\ < 2 < - d+1 ^° ■ d 2d+3 ■ n d ~'( d+1 ^> = 

Proof. Consider the execution of the call Witness (V, x, [d],l, ()). Including that call, there can be 
at most d calls to the Witness function except for the call that terminates due to d' — as in each 
of the other calls at least one index k £ [d] is removed from the tuple K. Hence, ri, ...,r<j € [d] 
and there are at most d d possibilities to choose these numbers. In the r th call, the round number t r 
is an element of [d' r ]o C [d] 0j and hence, there are at most (d + l) d possibilities to choose round 
numbers t±, . . . , t r * . In each round, at most d indices % are added to the tuple /. In total, tuple / 
contains at most d 2 ■ (d+ 1) indices, and hence, there are at most d 2 ■ (d+ 1) ■ n d ( <i + 1 ) choices for /. 
Once I is fixed also the indices in I* \ I are fixed because the indices added to / in Definition [25] 
are determined by /. The tuple I* contains at most d 3 + d 2 + d indices. In each call r, at most d+1 
vectors x^^ are generated. Hence, matrix A has at most d-(d+ 1) columns and at most d 3 + d 2 +d 
rows. This yields the desired bound 

|tf | < d d ■ (d + l) d -d 2 -(d+l)- n d2 < d+ V . 2 d i d +V< d3 + d2 + d ) 

< 2 d ■ d 2d ■ 2d 3 ■ n d2 ' (d+1) • 2 d2 '( d2 + d+1 H <i+1 ) 

< 2 (d+1)5 • d 2d+3 ■ n d2 ' l - d+1) . □ 



In the next step we analyze how much information of V is needed in order to perform the 
call Witness (V, [d], 1,1*, A, S, B, u). We will see that V does not need to be revealed completely 
and that some randomness remains even after the necessary information to perform the call has 
been revealed. This is the key observation for analyzing the probability Pry For this, 

let V be an arbitrary realization, i.e., we do not condition on the OKo-event anymore, and fix an 
index k G [d]. Only the indices i G Pk are relevant for function V k . Hence, we set 1% = I* D Pk 
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and we assume that the coefficients of V k belonging to indices i £ i£ are known. We denote this 
part of V k by V-i± and concentrate on the remaining part of V k which we denote by V k *. By 
the construction of index r/~ we know that only in the calls r = 1, . . . , r& information about the 
function V k must be available as in all subsequent calls this function is not considered anymore. 



Since in the r th call we consider only vectors that agree with the vectors x^ r ' tr ^ , 



..,x 



(r,d' r ) 



(see 



line H]) in the indices i G I*, it suffices to know all linear combinations V k , ■ (a ( - r, *'|/» — u|/»J. 
However, in call we would reveal too much information about x such that there would be no 
randomness left. Therefore, we reveal those linear combinations for all calls r E [rt — 1] and analyze 
call rfc more detailed like in the case of non-zero-preserving perturbations: 

Consider call number and let jk G [d' rk ] be the index such that kj k = k in round r^, i.e., 
V k is the jf 1 objective function in K in round . For this call the inequality t Tk < jk — 1 



holds. This is due to the fact that x < - Tk ' tr k ) \ 



-,\p k , i.e., V k x^ rk ' tr k' > = V k x, since this is the 



round when k is removed from tuple K . On the other hand, x 



(r»,trJ 



(ru) 

ecy k \ which means that 



yk t , x {r k ,t rk ) < yk t , x f or a rj i nc |i ces P — i 5 . . . ,t rk , where ki,...,kd' denote the indices tuple K 
consists of in round r^. Hence, t Tk < jk as k = kj k . 

There are only three lines where information about V is required: line[71 lineal and line ll6l For 



line [7] the values V k » 



I* — u \i k ) from round t = d! rk down to round jk are needed. In line [9] 



no additional information about V k , is needed since all considered vectors agree on indices ieJ^ 
with each other. For line [16] only in round jk — 1 the values V k , 



7 (r k ,j k ~l) 



. , jk — 2 are required. We write the linear combinations of V k , of the calls r = 1, 



and of call into the matrices Pk and Qk and obtain 



Pk = 



,(Mi) 



( r k -d'r- 



U\l- 



,(l,*l) 



U J. 



. ,a 



a 



u\i*,. 

(r k ,j k -l) 



,a 



(r»-l,t r ._i) 



ft 



. , r-fe - 1 
and 



Using the notation p k 



(r,t) 



,(r,t) 



u\i* we can write both matrices as 



Qk 



(Mi) 
Pk 



Pk 



(Mi) 
iPfc > 



'Pk 



(r k -l,d' r _ t ) 



(r*-l,i rfc _i) 



„(r fc ,j»,-2) 
'Pk 



K-jfc-i) 



(r- fc ,trj 



and 



OWfe-1) 



Note that the matrices = Pk(I*, A, u) and = Qk{I* , A, u) depend on the choice of u. 

Corollary 32. Let u G {0,1}" be an arbitrary shift vector, let (I* , A) G ^ be an arbitrary cer- 
tificate, and let V and W be two realizations such that V^- = W£- and Vh ■ q = WK ■ q for any 

1 k 1 k k k 

index k G [d] and any column q of one of the matrices Pk{I* , A,u) and Qk{I* , A,u). Then, for 
any e-box B G B e the calls Witness (V, [d], 1, /*, A, S, B, u) and Witness (M / , [d], 1, 7*, A, S, B, u) 
return the same result. 

In the remainder of this section we assume that VJ^- and the e-box B are fixed. In accordance 

with Corollary the output of the call Witness (V, [d], 1, /*, A, S, B, u) is determined if the 
linear combinations of Vh given by the columns of the matrices Pk and Qk are known in addition. 

k 

We are interested in the event Ei* t A,B, i-e., in the event that the output is a set {x'} such that 
yi...d . (^.i _ u *(j*^A)) G B. Since (I*, A) is a ^-certificate of x for some V and x, the output 
is always of the form {x'} due to Lemma |2"51 Hence, event S/»,a,s occurs if and only if for all 
indices A; the relation V k , ■ (x' — u*(I*, A)) I „ G Ck holds for some interval Ck of length e that 

™ k 

depends on the linear combinations of Vi* given by Pi and Qi for all indices £ G [d]. 
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Lemma 33. For any fixed index k € [d] the columns of matrix Pk(l*,A,u*(I*,A)j, of matrix 
Qk(l*,A,u*(I*,A)), and the vector p^*"^^ are linearly independent. 

Proof. Consider the square matrix Q k consisting of the vectors pi j for r E [rk] and t 6 
{t r , . . . , d' r }. Due to Lemma [27] and due to the construction of u*(I*,A) (see Equation [3]) ma- 
trix Qk is a lower triangular matrix and the elements of the principal diagonal are from the 
set { — 1, 1}. Hence, | detQfc = 1, i.e., the vectors p[ r,t ' are linearly independent. 

The columns of matrix Pk are pairwise distinct columns of matrix Q k . The columns of ma- 
trix Qk and vector p k rk ' trh ^ are linear combinations of the vectors p k rk ' trk \ . . . ,p k k ' rk> . As the 

(r t) 

vectors p k ' are linearly independent, it suffices to show that the columns of matrix Q k and vec- 
tor p k ' rk are linearly independent. For this consider an arbitrary linear combination of the 
columns of matrix Q k and vector pj c '*'"'* rfc '' that computes to zero: 



t=3k 



(r k ,t) 



E % 



(*) 



(r fc ,t) 



Pk 



. (r k ,t ) 

+ Hk-P k = 



If t rk = jk — 1 , then this equation is equivalent to 



X k 'Pk +^k-p k 



{r k ,t T 



t=t r .+l 



i.e., all coefficients are zero due to the linear independence of the vectors p[ r • If t rk < j k — 1, 
then the equation is equivalent to 



E4 j 

t=j k 



(*) Jr k ,t) 
' Pk 



3 k -2 



E A^. f) • 



3k -2 



■Pk 



. J r k,3k- 
Pk 



t=t r .+l 



(rk,t r .) 



0. 



Due to the linear independence of the vectors p k ' this implies X k — for t G {i rjfc + 1 , . . . , jk — 
2} U {jk, ■ ■ ■ , d' rk }, X)t=7 2 ^k * 1 = 0' ano - ^fe +Mfe = and we obtain that all coefficients are zero. 
In both cases, the linear independence of the columns of Qk and the vector p( Tk > tr k } follows. □ 



Corollary 34. Let (I*, A) G be an arbitrary certificate and let B S 
Then, 



Pry [E_ 



I*,A.B 



<(27) 



and 



Ptv[ e i;a,b] <2 d -^- d 
in case of quasiconcave densities, where 7 = d + d 2 + d. 



b d -e d 



be an arbitrary e-box. 



Proof. For fixed index k € [d] we write the columns of matrix P kl of matrix Qk, and vector p k 
into one matrix Q k and consider the matrix 



(r k -t ) 



Qi 



Q' = 



€{-1,0,1} 



31 



for some integer I < \I*\. By Lemma |33lthis matrix has (full) rank I. Now, permute the columns 
of Q' to obtain a matrix Q where the last d columns belong to the last column of one of the 
matrices Qk- For any index i £ I* let X{ = V k be the i th coefficient of V k where k is the unique 
index such that i £ P k . Event Er- t A,B holds if and only if the d linear combinations of the 
variables Xi given by the last d columns of Q fall into a rf-dimcnsional hypercube C depending 
on the linear combinations of the variables Xi given by the remaining columns. The claim follows 
by applying Theorem [35] for matrix A = Q T and due to the fact that |J*| < 7 (see proof of 
Lemma EH)- □ 



Proof of Theorem [7[ We begin the proof by showing that the OKo-event is likely to happen. For 
any index t £ [d] and any solutions x, y £ S such that x\p t 7^ y\p t the probability that \V t x — 
V t y\ < e is bounded by 2<fts. To see this, choose one index i £ P t such that x% 7^ yi and fix all 
coefficients V? for j 7^ i. Then, the value V* must fall into an interval of length 2e. A union 
bound over all indices t £ [d] and over all pairs (x,y) £ S x S such that x\p t 7^ y\p t yields 

< 2 2n+1 d(f>e. For 7 = d 3 + d 2 + d, we set s = (2~f) 1 ~ d ■ <jp for general density 



Pr 



OKq(^) 



functions and s = 2 d ■ 7 7 d ■ cf> d in the case of quasiconcave density functions. Then, we obtain 



E v [PO(F)]< £ Pr v [Ep^ B ]+2 n -Pr v [OK (V) 

(J*,A)G^SeB E 

< s ■ £d + 2 " • 22n+1 ^ £ 



< 2 (d+1)5 ■ d 2d+3 ■ n d2 ' (d+1) 



2n 



s . £ d + 2 3n+1 d6e 



2(d+l) 5 +d . ffd+3 _ n d 3 +d 2 +d _ g , 2^n+l 



tie . 



The first inequality is due to Corollary [30l The second inequality is due to Corollary [34j The 
third inequality stems from Lemma 1311 Since this bound is true for arbitrarily small e > 0, the 
correctness of Theorem Q] follows. □ 



6 Some Probability Theory 

Theorem 35. Let Xx,...,X n be independent random variables, each with a probability density 
function /j : [—1,1] — > [0,0]. Let m £ [n], let A £ {—1,0, l} m ™ be a matrix of full rank, let k £ 
[rn — 1] be an integer, let (Yi, . . . ,Y m ^k, Z\, . . . , Zk) T = A-(X\, . . . ,X n ) be the linear combinations 
of Xi, . . . , X n given by A, and let C be a function mapping a tuple (y±, . . . , y m —k) £ R m ~ fc to a 
hypercube C(y±, . . . , y m —k) Q with side length e. Then, 

Pr [(Z u . . . , Z k ) £ C(Y U . . . , F m _ fc )] < (2n) n - k ■ 0" ■ e k . 

If all densities fi are quasiconcave, then even the stronger bound 

Pr [(Z 1: ...,Z k )€C(Yi,..., Y m _ k )} < 2 k ■ n n ~ k ■ <j> k ■ e k 

holds. 

Proof. Without loss of generality let m = n, i.e., matrix A is a square matrix where det A 7^ 0. 
Otherwise, we could insert m — n linearly independent rows from {0, 1}™ after row to — k until 
matrix A is of that form. That way we get new random variables Y m -k+i, ■ ■ ■ , Yn-k- Then, we can 
extend the domain of C to R"~ fc by the definition C(yi, . . . , y n -k) ■— C(z/i, • ■ • , 2/m-fe); which shows 
that it is sufficient to deal with the case m = n because Pr [{Z\, . . . , Z k ) £ C(Yi, . . . , Y m - k )] = 

Pr[(Zi,...,z fc )ec(r 1 ,...,y n _ fc )]. 
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As matrix A is a full-rank square matrix, its inverse A 1 exists and we can write 
Pr[(Z 1 ,...,Z k ) GC(Y 1 ,...,Y n _ k )}= [ f f YtZ (y,z)dzdy 

= [ I \Aet{A- 1 )\- f x {A- 1 -{y^^dzdy 

JyER n - k JzeC(y) 

<e k - f max f x (A- 1 -(y,z))dy, 

J vm n- k z£R k 



where fy,z denotes the common density of the variables Y\ , . . . , Y n _k , Z\, . . . , Z k and fx = 11™= 1 /* 
denotes the common density of the variables X\ , . . . , X n . The second equality is due to a change 
of variables, the first inequality stems from the fact that | det (^4 _1 )| = |l/dct A\ G (0, 1) since A 
is an integer matrix. 

In general, we can bound the integral in the formula above by 

/ max.f x (A _1 • [y,z))dy < / max f x (A^ 1 ■ (y, z))dy 

: f <j) n dy = (2n) n - k - * n 



where the first inequality is due to the fact that all variables Yi can only take values in the 
interval [— n,n] as all entries of matrix A are from { — 1,0, 1}. 

To prove the statement about quasiconcave functions we first consider arbitrary rectangular 
functions, i.e., functions that are constant on a given interval, an zero otherwise. This will be the 
main part of our analysis. Afterwards, we analyze sums of rectangular functions and, finally, we 
show that quasiconcave functions can be approximated by such sums. 



Lemma 36. For i G [n] let <pi > 0, let Ii CM. be an interval of length i i} and let fi : R — > R be the 
function 

4>i ■ X G Ii, 
: otherwise. 



fi(x) 



Moreover, let f : R™ — > R be the function f(x\, . . . , x n ) — 11™= l fi( x i) an d fet A G { — 1, 0, 1}™ X ™ be 
an invertible matrix. Then, 



L 



yeR n- k zeR k 



max /(A- 1 • (y, z))dy < 2 k ■ (n - k)\ ■ X ■ ]T II ' 



/ i4I 



where \ — 11™= l 0» an ^ where the sum runs over all tuples I = . . . ,i k ) where 1 < i\ < . . . < 
ik < n. 

Proof. Function / takes the value \ 011 the n-dimensional hyper-cuboid Q = n"=i h an d i s zer0 
otherwise. Hence, 

/ max/^- 1 • {y,z))dy = x - vol(Q') 

for 

Q' = {y G R n_fc : 3z G M. k such that A~ x ■ (y, z) G Q) 
= {y G R"~ fc : 3z G R k 3x G Q such that (y, z) = A ■ x) 
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= (P-A)(Q), 

where P = [in-fc, Q( n — fc) X fc] is the projection matrix that removes the last k entries from a vector 
of length n. In the remainder of this proof we bound the volume of M (Q) where M = P ■ A G 
{—1,0,1} fe ) x ™ L e ^ ai — ■ c o an( j ft. —. c i fag } e ft an d the right bound of interval 2j, 
respectively. For an index tuple / = {i\, . . . ,i k ), 1 < ii < . . . < ifc < n, and a bit tuple J = 
(ji.---.Jfc) 6 {0,l} fc , let 



be one of the 2 k ■ (™) (n - fc)-dimensional faces of Q. We show that M(Q) C U/Uj-M' (-*/)• 
Let y £ M(Q), i.e., there is a vector ieQ such that y = M ■ x. Now, consider the polytope 

i? = {(a/, s') efx 1" : M ■ x' = y', x' + s' = 6', and x' , s' > 0} , 

where y' = y — Al ■ a and b' = b — a for a = (oi, . . . , a„) and b = (b±, . . . , b n ). This polytope is 
bounded and non-empty since (x , s ) 6 i? for .t' = a; — a and s' — b ~ x. Consequently, there exists 
a basic feasible solution (x*,s*). As there are 2n variables and 2n — k constraints, this solution 
has at least k zero-entries, i.e., there are indices 1 < %\ < . . . < i k < n such that either x* t = 
(in that case set ft = 0) or x* t = b\ (in that case set j t = 1) for any t £ [k]. Now, consider the 
vector x = x* + a 6 [0, b'] + a = Q. We obtain M ■ x = y and Xi t = c?' for all t G [k]. Hence, 
x e F/ for J = (ii, . ..,i k ) and J = (ji, . . . ,j k ), and thus y 6 M (F/) . 

Due to this observation we can bound the volume of M{Q) by ^ 7 vol(M (F/)) ■ It remains 
to show how to bound the volume vol(M(F/ j). For the sake of simplicity we only consider / = 
(n — k+ 1, . . . , n) in the following analysis. Let <fi J : — > F/ be the function 4> J (x) — T ■ x + v J , 

where T = [l n - k , 0(„_ fc ) xfe ] T and v J = (0, . . . , 0, c£_ k+1 , . . . , c^ fc ) . Using function <?!> J is the 
canonical way to describe the affine subspace defined by face F/: it adds the fixed coordinates 
of Fj to a given vector of length n — k. Hence, function </> J , restricted to the domain F' = n^Ti ^ > 
is bijective. With ip — M o (j> J we obtain 

vol(M(F/)) = / ldx= [ \detDip(x)\dx= [ \ det(M ■ T)\dx 
JtP(F') Jf' Jf' 

n—k 

= | det(M • T)\ • vol(F') = | det(M • T)| • JJ 4 . 

i=i 

Matrix M ■ T = P ■ A ■ T is a (n - k) x (n - fc)-submatrix of A. Thus, | det(M -T)| < (ra - fc)!, and 
we obtain the bound 

/ max /(A" 1 • (y, z))dy = x ■ vol(M(Q)) < X • ^ E vo1(M(F/)) 

-2 fc .(n-fc)!-x-^[]^- D 

In the next step we generalize the statement of Lemma [35] to sums of rectangular functions. 

Corollary 37. Lei iVi, . . . , N n be positive integers, let cf>i^ k > be a non-negative real, let Ii k C K 
6e an interval of length £i tk , and let fi k : K — >• K be the function 



fi,k(x) 



: otherwise . 
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i € [n], k s [Afj]. Furthermore, let ft: K — > K 6e the function /, = X)fc^=i /»>fe> ^ /• ^™ ~ ^ ^ ^ e ^ e 
function f(x\, . . . ,x n ) = J\7—i fi{ x i)> an d ^ A G {—1,0, 1}™ X ™ fr e arl invertible matrix. Then, 

f m^f(A~ 1 .(y,z))dy<2 k -(n-ky.-J2 (]!<* ) ' (l[xi) 

where cr, = Xwb=i ^ii* ' 4,fc anc ^ Xi = 2fe=i 'fefe ani ^ where the first sum runs over all tuples I = 
(ii, . . . ,ik) for which 1 < ix < . . . < ik < n. 

Proof. For indices fej £ [Ni] let fk u ...,k n ( x i, ■ ■ ■ , x n ) — Y[7=i /*,** This lunction is of the 

form assumed in Lemma |36l and takes only values zero and Xfei,...,fc„ — 11™= 1 We can write 

function / as 

n n Ni Ni N„ n 

f{x 1 ,...,X n ) = '[[f l (x l ) = Y[ E fiM( x i) = E E II 
i=l i=l fci=l fci = l fe„=li=l 

iVi AT„ 
= E ••• X /fci,...,fcn( a; l)---) a; n)- 

fc 1= l fc„=l 

For the sake of simplicity we sometimes write ( Ylkj=i )je(ji u) ms * ea< ^ °^ Efc^i ■ ■ ■ Sfe/ ==i an< ^ 
we drop the bounds of the sums if they are clear. Now, we can bound the integral as follows: 

/ ™uc/( A_1 • (»> = / mgE ••• E fkx,.,^- 1 ■ (y, z))dy 

<E - E / max/fei,...,^^" 1 ' (y^))dy 

Nx N n / \ 

^ E - E 2 M«^)4 tl t-EII^ 

fe 1= l fe„=l \ I 10 I 

= 2 fe . (n k )\ ■ J2 ( E ) f II • II E • 4.0 

= 2 k ■ (n - k)\ ■ x ( n e ■ £ ^ • ( e ) n 

/ \i0/ fei V fei / ieJ ie/ 

= 2 fe ■ (n - *)i ■ x f n e • ^ ■ n E ^iA ) 

/ Vi^/ fc* ie/ fc* / 

=2 fe -(--fc)!-Efn^-fnO . 

/ J \iei / 

where the second inequality is due to Lemma 1361 □ 

To finish the proof of Theorem [35] we round the probability densities fi as follows: For an 
arbitrarily small positive real S let := \fi/S~\ -5, i.e., we round /j up to the next integral multiple 
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of 5. As the densities /, are quasiconcave, there is a decomposition of gi such that gi = X)fc=i fi,k 
where 

(6x1 Ni 
^ k = { I otherwise, md Xi == g&,* = ^^ito , 

where fa^ are intervals of length and are positive reals. The second property is the 
interesting one and stems from the quasiconcaveness of fa. Informally speaking the two-dimensional 
shape bounded by the horizontal axis and the graph of gi is a stack of rectangles aligned with axes. 
Therefore, the sum Xi of the rectangles' heights which appears in the formula of Corollary 1371 is 
approximately 6. Without the quasiconcaveness Xi might be unbounded. 
Applying Corollary I3"T1 we obtain 

/ max/* (A -1 .(y,z))dy= f max JT fa ((A^ 1 ■ (y, z)) )dy 

~ n 

^ / ma^fU^A" 1 • (y,z)) )dy 

< 2 k ■ (n - ky. ■ j2 ( n e ■ • ( n xi] 

I \i£/fc; = l / Kiel ) 

-"■ ( "-* ), -?fe/-M"*')-(5 Ja )- 

Since < J[_ ls] 9idx < /r_ X)1 i(/i +S)dx = l + 25 and < Xi < s uP:re[-i,i] + <5 < + <5, this 
implies 

/ max/ x (A" 1 .(y,z))dj;<2 fc .(n-fc)!.^ (n(l + 25) ) ■ (n^ + 5 )) 

= 2 k • (n - fc)! • ^(1 + 2 < 5)"^ fc • (0 + <5) fc 

= 2 fe • (n - fc)! • Q • (1 + 2<5)"~ fc • (0 + 

< 2 fc • n"- fc • (1 + 2S) n - k ■ (6 + S) k . 

As this bound is true for arbitrarily small reals S > 0, we obtain the desired bound. □ 

7 Conclusions and Open Problems 

With the techniques developed in this paper we settled two questions posed by Moitra and 
O'Donnell [13]: For quasiconcave densities we showed that the exponent of <f> in the bound for the 
smoothed number of Pareto-optimal solutions is exactly d. Moreover, we significantly improved on 
the previously best known bound for higher moments of the smoothed number of Pareto-optima 
by Roglin and Teng [T7] . 

Maybe even more interesting are our results for the model of zero-preserving perturbations 
suggested by Spielman and Teng [3D] and Beier and Vocking [3]. For this model we proved the 
first non-trivial bound on the smoothed number of Pareto-optimal solutions. We showed that 
this result can be used to analyze multiobjective optimization problems with polynomial and even 
more general objective functions. Furthermore, our result implies that the smoothed running time 
of the algorithm proposed by Berger et al. [5 to compute a path trade in a routing network is 
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polynomially bounded for any constant number of autonomous systems. We believe that there are 
many more such applications of our result in the area of multi-objective optimization. 

There are several interesting open questions. First of all it would be interesting to find asymp- 
totically tight bounds for the smoothed number of Pareto-optimal solutions. There is still a gap 
between our upper bound of 0(n 2d cf> d ) for quasiconcave (/>-smooth instances and the best lower 
bound of Vt,(n d ~ 15 (f) d ) [TU]. Only for the case d = 1, we could show that the upper bound is 
tight [6]. 

Especially for zero-preserving perturbations there is still a lot of work to do. We conjecture 
that our techniques can be extended to also bound higher moments of the smoothed number of 
Pareto-optima for </>-smooth instances with zero-preserving perturbations. However, we feel that 
even our bound for the first moment is too pessimistic as we do not have a lower bound showing that 
setting coefficients to zero can lead to larger Pareto-sets. It would be very interesting to either 
prove a lower bound that shows that zero-preserving perturbations can lead to larger Pareto- 
sets than non-zero-preserving perturbations or to prove a better upper bound for zero-preserving 
perturbations. 

It would also be interesting to know whether our results extend to sets S C {— m, . . . , to}™ of 
solutions for a constant to > 1. We conjecture that our results still hold with worse constants that 
depend on to. 
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